← Home ← Back to /g/

Thread 106193870

315 posts 232 images /g/
Anonymous No.106193870 [Report] >>106193885 >>106197276
/ldg/ - Local Diffusion General
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106190450

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://github.com/Wan-Video
2.2 Guide: https://rentry.org/wan22ldgguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base/tree/main
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
ポストカード !!FH+LSJVkIY9 No.106193885 [Report] >>106193920 >>106193932
>>106193870 (OP)
b l e s s e d
t h r e a d
o f
f r e n z o n e ;3
Anonymous No.106193897 [Report]
>he believed
Anonymous No.106193901 [Report]
3rd for RAGE attention
Anonymous No.106193913 [Report]
>he brapped
grim
Anonymous No.106193914 [Report]
4st for RAPEAPE
ポストカード !!FH+LSJVkIY9 No.106193920 [Report] >>106193932 >>106194632
>>106193885
stop reportbombing my posts
i generated that cutie for a user here
how can he find her is she is balate?!


>https://files.catbox.moe/a1qv37.mp4
Anonymous No.106193930 [Report]
fingers and emma seem ok with annealed
ポストカード !!FH+LSJVkIY9 No.106193932 [Report]
>>106193920
>>106193885
SMELL YA LATER<3
:3
Anonymous No.106193934 [Report] >>106194041
gen jam?
Anonymous No.106193945 [Report] >>106193958
>real schizo hours
ポストカード !!FH+LSJVkIY9 No.106193958 [Report] >>106193983
>>106193945
its 1pm?
Anonymous No.106193983 [Report] >>106194012
>>106193958
real schizos never stop
Anonymous No.106194008 [Report] >>106194021
Yet another ComfyUI option that isn't default set to this making the ui everything but comfy
Anonymous No.106194010 [Report]
riggi penaltini
ポストカード !!FH+LSJVkIY9 No.106194012 [Report] >>106194026 >>106194033 >>106194632 >>106194731
>>106193983
stop what? mass-reporting? acting in bad faith? LARPing?
being an all around nuisance?
what do they never stop? (besides fapping)

enlighten me
Anonymous No.106194017 [Report]
anime girl opens a box of pizza and eats a slice.
Anonymous No.106194021 [Report] >>106194050
>>106194008
and max ui fps is limited to 120 at max, lmao
ポストカード !!FH+LSJVkIY9 No.106194026 [Report] >>106194040
>>106194012
its not supposed to be that serious mate
but he MAKES it this way;
now i HAVE to be here
because he doesn't
want it to happen;
its a mexican -
s t a n d o f f
;3
Anonymous No.106194033 [Report]
>>106194012
schizoing... and... posting mikus i guess...
Anonymous No.106194040 [Report] >>106194056
>>106194026
why did you post "https://desuarchive.org/g/thread/106190450/"
i dun get it
nta btw
ポストカード !!FH+LSJVkIY9 No.106194041 [Report]
>>106193934
it sadly seems gover ;c
if we get tardposters corrupting the categories (intentionally) it ruins it for everyone else

gen jam should be ONE topic
and one chosen by the wheel
Anonymous No.106194050 [Report]
>>106194021
And also no matter what the limit is, the gpu seems to work ~5% harder if there is a limit on rendering the ui even at 30fps rather than if its uncapped at 165hz/fps
ポストカード !!FH+LSJVkIY9 No.106194056 [Report] >>106194074
>>106194040
>post got report-bombed
>i didnt even notice until he asked for 'catbox'
:c
Anonymous No.106194074 [Report] >>106194140 >>106194457
>>106194056
i dont get it, why are you posting a desuarchive link of previous op inside previous op?
Anonymous No.106194081 [Report] >>106194140
can i get a /ldg/ discord welfare check? these threads are getting slow...
Anonymous No.106194093 [Report] >>106194111 >>106196911
Anonymous No.106194111 [Report] >>106194306
>>106194093
Anonymous No.106194122 [Report]
ポストカード !!FH+LSJVkIY9 No.106194140 [Report] >>106194181 >>106194600
>>106194081
>>106194074
so you can see which posts were removed anon
Anonymous No.106194153 [Report] >>106194206 >>106194207
>my pizza is cold, you bitch!
Anonymous No.106194181 [Report] >>106194202
>>106194140
one post that he didnt ask catbox for was removed aswell
are you a femboy?
ポストカード !!FH+LSJVkIY9 No.106194202 [Report] >>106194308 >>106194632
>>106194181
hello, i do not want my posts removed when they are on topic; if that happens you will get a desulink.

thank you for playing.
goodbye ;3

>good luck figuring out which collage-bait im cooking up this time muehehee
Anonymous No.106194206 [Report] >>106194217
>>106194153
looks great
Anonymous No.106194207 [Report]
>>106194153
better output:
ポストカード !!FH+LSJVkIY9 No.106194217 [Report] >>106194265
>>106194206
c-c-chroma does all t-this? anon-sama? ;_;
please do mai shiranui
Anonymous No.106194265 [Report]
>>106194217
are you a femboy
Anonymous No.106194290 [Report] >>106195051 >>106195140
Anonymous No.106194306 [Report] >>106194335
>>106194111
Anonymous No.106194308 [Report]
>>106194202
>>good luck figuring out which collage-bait im cooking up this time
glad to see the therapy is working :)
Anonymous No.106194326 [Report]
Anonymous No.106194335 [Report]
>>106194306
thank you
Anonymous No.106194340 [Report] >>106194398 >>106194442
WELCOME TO LONDON!
Anonymous No.106194364 [Report]
After testing it Chroma actually works well and stabilizes if you're using the flash lora.
Anonymous No.106194380 [Report] >>106194877 >>106196130
Reposting >>106194334
Here's seed in question (same problem with every seed I tried on that prompt)
https://files.catbox.moe/lvkdi9.png

>>106194345
I'm not aware of that
I haven't tried the annealed version yet
Anonymous No.106194386 [Report] >>106197867
Anonymous No.106194398 [Report]
>>106194340
anglo spotted
Anonymous No.106194437 [Report]
>when you see a white person in the UK:
Anonymous No.106194442 [Report]
>>106194340
Sadiq needs a part and parcel sign to go with this.
Anonymous No.106194447 [Report] >>106194536
Anonymous No.106194457 [Report]
>>106194074
not much of what he does makes sense, best to just ignore
Anonymous No.106194506 [Report] >>106194540
a man with a robotic arm lights a cigarette and smokes it.
Anonymous No.106194519 [Report] >>106194619
so is radial attention snakeshit or does it work
Anonymous No.106194536 [Report]
>>106194447
where are her organs
Anonymous No.106194540 [Report]
>>106194506
where did it come from, where did it go?
Anonymous No.106194547 [Report]
Anonymous No.106194548 [Report]
Anonymous No.106194550 [Report] >>106194588
how does he get so much smoke from a cig
Anonymous No.106194568 [Report]
whats the difference between chroma annealed and flash-heun?
Anonymous No.106194574 [Report] >>106194710
Fake VACE
>Fake VACE

https://huggingface.co/CCP6/FakeVace2.2
Anonymous No.106194583 [Report] >>106195099
Anonymous No.106194588 [Report]
>>106194550
because vaping was probably also in the training data
Anonymous No.106194600 [Report]
>>106194140
this is a good example of what they're taking about
https://desuarchive.org/g/thread/105895325/
Anonymous No.106194619 [Report]
>>106194519
It does work but it's got annoying resolution restrictions.
Anonymous No.106194632 [Report]
>>106193920
>>106194012
>>106194202
its not that serious mate ;3
Anonymous No.106194710 [Report] >>106194851
>>106194574
>vibe coded
>2 days ago
>no example outputs
downloading now
Anonymous No.106194730 [Report] >>106194759
>the jokes are over, now it's your turn
Anonymous No.106194731 [Report]
>>106194012
you have decided to be a bad doggo
Anonymous No.106194742 [Report]
The default settings for Diffusion pipe may be incorrect for Chroma v50, loras are broken when training Chroma v50 (HD)

Any ideas?
Anonymous No.106194759 [Report]
>>106194730
it's good he doesn't shoot the camera because then there'd be no footage
Anonymous No.106194812 [Report]
pistol
>laugh, you cunt
Anonymous No.106194813 [Report]
I don't tend to notice it as much anymore but there's a lot of wayward lavalier mics in videos
Anonymous No.106194837 [Report] >>106195099
Anonymous No.106194851 [Report]
>>106194710
kek, I'm just waiting for real vace. wan 2.2 fun is already out, quality looks decent but its still massive in file size
Anonymous No.106194865 [Report]
not bad
Anonymous No.106194872 [Report]
still works
Anonymous No.106194877 [Report] >>106195087
>>106194380
>>106194363

No luck with annealed. I'll reword the prompt to see if it improves
Anonymous No.106194878 [Report] >>106194910 >>106194929 >>106194950
https://civitai.com/models/1598938?modelVersionId=1967502

Forgecucks? Now you don't have excuse!
Anonymous No.106194883 [Report] >>106195099
Anonymous No.106194910 [Report]
>>106194878
Loras?
Anonymous No.106194929 [Report]
>>106194878
ControlNet?
Anonymous No.106194931 [Report] >>106194942 >>106194984 >>106195500
a man holds up a sign saying "LDG".
Anonymous No.106194942 [Report]
>>106194931
he sure does
Anonymous No.106194950 [Report] >>106194985
>>106194878
but it's just custom nodes that I have to bloat even more? who does this actually help if it just adds to the amount of shit that can potentially break in comfy?
Anonymous No.106194951 [Report]
How do you prompt for camera angles in Chroma? Any examples of how I should do it? Right now it's just ignoring anything I tell it to do in that regard.
Anonymous No.106194984 [Report] >>106196283 >>106196554
>>106194931
a man drinks a beer from a beer can on his desk.

neat, it worked
Anonymous No.106194985 [Report]
>>106194950
STOP USING FORGE RIGHT NOW!
YOU ARE NOT COLLABORATING!
Anonymous No.106195010 [Report]
a man throws his PC monitor on the floor.
Anonymous No.106195017 [Report] >>106195095
>after using chroma from months i hear that you should use "aesthetic 11" in positive and "aesthetic 0" in negative
Anonymous No.106195051 [Report]
>>106194290
that's cool
Anonymous No.106195069 [Report] >>106195124
'night and happy latent space dreams
Anonymous No.106195087 [Report] >>106195126 >>106195141 >>106195187 >>106195223 >>106195231 >>106196030
>>106194877
Okay, so from what I understand the Chroma dev shrank the dataset size for this version. Since now if I remove the neg entirely, I get a very fuzzy looking image for that prompt even when asking for a photograph. So my guess is that I need a much stronger neg (but I tried a bunch of new words and it barely improved), and the v50/v50 annealed are strongly biased towards drawings. Here is one of my usual footfag prompts I test with no changes in neg. Yeah, that is exactly what is happening.
Anonymous No.106195095 [Report]
>>106195017
I doubt it matters by might as well try
Anonymous No.106195097 [Report] >>106195116
hm. either I suddenly can't prompt for shit or there's something wrong with the final chromas. i've already swapped to the new vae.
Anonymous No.106195099 [Report]
>>106194883
>>106194837
>>106194583
>oh yes, Iodestones, I'm going to download your shitty model to do the same thing SDXL does, but it takes 10 times longer!
Anonymous No.106195116 [Report]
>>106195097
>new vae
There's a new vae for it?
Anonymous No.106195124 [Report]
>>106195069
nice
Anonymous No.106195126 [Report]
>>106195087
wow, left has the "chroma look" that people have been so optimistic about, while right looks like it wants to converge on the standard blown out slop look. eww.
Anonymous No.106195140 [Report]
>>106194290
no, that's really cool
enjoying fluid motion, interactions, shades so much
Anonymous No.106195141 [Report] >>106195264
>>106195087
Can you give me the deets on that gen so I can play around with it? Catbox would be cool, too.
The v50 version looks rather disappointing.
I'm still running the scheduler/sampler tests right now and there's some promise in there, for sure, though.
Anonymous No.106195147 [Report] >>106195237
don't make me do a single vote survey per ip to see who uses chroma...
Anonymous No.106195187 [Report]
>>106195087
Is this normal v48 or v48-detail-calibrated?
Anonymous No.106195204 [Report]
Anonymous No.106195209 [Report] >>106195258
the man holds up a plate with a McDonalds cheeseburger and McDonalds fries on it.

reviewbrah could AI generate his videos and just collect the money desu
Anonymous No.106195211 [Report]
just bought 128GB of RAM
that means tomorrow they will release Wan 2.3 which requires 256GB
Anonymous No.106195223 [Report] >>106195476
>>106195087
I find that v49 gives me the best results, which kind of makes sense since it's the one trained at 1024 resolution without any merges against other highres experiments.
Anonymous No.106195231 [Report] >>106195250 >>106195316 >>106195380 >>106195820 >>106197310
>>106195087
You guys still don't get it do you? The classic "Chroma look" of the left image that some people like, is solely due to the fact that the model was trained at 512 res but you're genning at 1024. This manifests as essentially additional noise added at all stages of the diffusion process. In the low noise timesteps, the model diffuses this additional noise into realistic-looking texture rather than something that looks overly smooth.

The right is the "correct" version. It matches what most of the images in the training data look like. The same process that is making the left image look realistic also makes it fuck up anatomy and have random details in the image that don't make sense. You just prefer the look it has on the textures, without realizing all the other aspects of the image that it's hurting.

Chroma v50 has much better anatomy and more consistent details than any previous version, because it's actually trained at the resolution people gen at. The solution to textures that are too smooth is a carefully curated aesthetic finetune that avoids any slopped looking training images.
Anonymous No.106195237 [Report]
>>106195147
why does it matter tho
Anonymous No.106195250 [Report]
>>106195231
NTA but can you post a better comparison showing that v50 is in fact superior
Anonymous No.106195258 [Report]
>>106195209
the cheeseburger is smaller than it seems!
Anonymous No.106195264 [Report] >>106195315 >>106195316 >>106195372
>>106195141
Sure, here's gen anon
https://files.catbox.moe/bip9ro.png

Keep in mind the fake skin issue shows up other places in other prompts as well, but those are isolated cases (pic rel), I've never seen it this bad. I've tried that footfag prompt consistently across every Chroma version and this is the first time I've seen a slopped result.
Anonymous No.106195312 [Report] >>106195357 >>106195390 >>106195419
I migrated my 3090fe to a 5090fe, and the difference in speed is simply insane.

For the following conditions on wan2.2 i2v (fp8) :
3090@260W
5090@460W
720p/113 frames/4+5 steps/cfg 1
3090: ~25 min
5090: ~6 min

Power limiting the 5090 at 80% gives me 93% of the performances from my tests. So I just limited to that, it's summer and the nvidia melting cable thing is still in my mind.

I also did some tests for block swapping in wan2.2 (fp8) text to image.
211.67 seconds 30 blocks swapped
200.14 seconds 10 blocks swapped
193.55 seconds 0 blocks swapped
The difference is pretty small, surprising to see the model not being fully in vram having this little of an impact.
I guess it means I can have 40 swapped blocks + long 161 frames videos for example on wan without having to get a 48GB+ card once the repetition problem is solved.
Anonymous No.106195315 [Report] >>106195464
>>106195264
post v48/v50 version of the Filipino slut
Anonymous No.106195316 [Report] >>106195820
>>106195264
Thanks brother. Appreciate it.
I'm at 12/17 sampler/scheduler plots now and I'll run some tests on your setup afterwards.
>>106195231
From my understanding, that would imply that we could get a similar result genning at a bigger resolution again, due to the additional noise we'll be gaining again, right?
Anonymous No.106195327 [Report] >>106195412
the man takes a slice of pizza and eats it.

using the 2.2 i2v lora, pretty good desu
Anonymous No.106195357 [Report] >>106195455
>>106195312
very nice, thanks for posting these tests
crazy speedup
..however
>460w
dang, my whole rig consumes that much
>I guess it means I can have 40 swapped blocks + long 161 frames videos for example on wan without having to get a 48GB+ card once the repetition problem is solved.
have you tried radial attention?
Anonymous No.106195372 [Report]
>>106195264
Nta
Hey man, try this workflow instead, slight difference but I had success with having less of the grain with it I think
https://files.catbox.moe/d03j58.json
Anonymous No.106195380 [Report]
>>106195231
OK, so why does the left image obviously have much more detail? Look at the hair!
Anonymous No.106195390 [Report] >>106195420 >>106195455
>>106195312
but the fp8 quality sacrifice is large, q8 is almost full quality
Anonymous No.106195396 [Report]
Anonymous No.106195406 [Report] >>106195448
I finally found out why my prompt had so little effect on 2.2 image to video: setting the cfg to >1 solved the problem, suddenly my prompts were having quite the impact.
I thought this wasn't a requirement anymore but it is, which is very annoying since it almost doubles my gen times.
Anonymous No.106195412 [Report]
>>106195327
this time, less impressed:

the man throws the pizza out the window behind him, and shakes his head in disgust.
Anonymous No.106195419 [Report] >>106195495
>>106195312
just wait till jenga, radial attention and sage 3 fixed drops, speed be nutty
Anonymous No.106195420 [Report] >>106195480 >>106195486
>>106195390
NTA but I always thought FP8 had more quality than Q8. Is it the other way around?
Anonymous No.106195438 [Report]
Even though it can sometimes want to gen cartoony images, there is some body horror, and there's a little gacha like with all other models, Chroma was always good enough for a lot of things, picrel v48.

From initial testing the new versions do seem to be better too but I'm genning something else now so I can't test directly.
Anonymous No.106195448 [Report]
>>106195406
works on my machine
Anonymous No.106195455 [Report] >>106195486 >>106195505 >>106195630 >>106195689
>>106195357
>very nice, thanks for posting these tests
No problem, it's the stuff I wish more anons posted so I might as well do it.

>have you tried radial attention?
No, is it supposed to solve that?

>>106195390
I don't think it really matters when quite the sacrifice was made when I started using lightx2v.
And I'm using scaled fp8 which is like q8 and close to fp16 from what I understand.
Anonymous No.106195464 [Report] >>106195496 >>106195734 >>106196375 >>106196487
>>106195315
She's meant to be Japanese. Anyways, I think Chroma v48 does slop quite a few of those as well, as do other versions (though I recall it happening less on v36), but every now and then there's realistic looking gen.

That being said, with v50 I've not gotten an unslopped result with that prompt. Here's catbox to check performance across different seeds yourself
https://files.catbox.moe/qu35uz.png
Anonymous No.106195467 [Report] >>106195525
why do image to video first frames always look so bad/weird?
Anonymous No.106195476 [Report]
>>106195223
Trying v49 now
Anonymous No.106195480 [Report]
>>106195420
With fp8 scaled there's no difference in quality, with normal fp8, q8 is better quality, but it will always be slower than fp8 / fp8 scaled
Anonymous No.106195483 [Report]
I like to prototype larger prompts with changing parameters, but I found the solutions for wildcards too complicated for quick use. I made this myself, but are there any decent solutions that make it just as easy for me with more functionality?
Anonymous No.106195486 [Report] >>106195505 >>106195509 >>106195630 >>106195689
>>106195455
>scaled fp8 which is like q8
Would like to see that comparison
>>106195420
picrel
Anonymous No.106195495 [Report]
>>106195419
Yeah but hopefully the quality won't be too much.
I know sageattention 3 speedup is constrained by looking noticeably worse compared to sage2++.
Anonymous No.106195496 [Report]
>>106195464
Try v49, it's the version trained at 1024 without any weird merges
Anonymous No.106195500 [Report]
>>106194931
Anonymous No.106195505 [Report] >>106195689 >>106195775
>>106195486
it is good and it is faster
see vidrel
>>106195455
>No, is it supposed to solve that?
yeah
Anonymous No.106195509 [Report] >>106195630 >>106195689
>>106195486
That's not fp8 scaled, that's an old comparison using plain fp8 from ages ago you downloaded from reddit, I know, I read the post back then as well
Anonymous No.106195510 [Report] >>106195721
Anonymous No.106195525 [Report]
>>106195467
I wish I knew, maybe it's the video creation shitting up something?
Either way, the first frame always look blurry to me to.
Anonymous No.106195531 [Report] >>106195565
Poll:

https://poal.me/z3ek04
https://poal.me/z3ek04
https://poal.me/z3ek04
https://poal.me/z3ek04
https://poal.me/z3ek04
Anonymous No.106195539 [Report] >>106195574 >>106195607
since you're talking about chroma, how do I get a realistic photo orc? I can only get it in 2d or 3d.
Anonymous No.106195565 [Report] >>106195580
>>106195531
v38 where?
Anonymous No.106195574 [Report]
>>106195539
Be really specific about it being a photo, about what camera is being used to take the photo, about how the photo was taken, etc. Do something like:
>A candid amateur photo taken with a Sony Cybershot
>A high-quality digital photo taken with a Fujifilm X100V using a Fujinon 23mm f-2 lens
Anonymous No.106195580 [Report]
>>106195565
best gen itt
Anonymous No.106195606 [Report] >>106195618
the man puts down the pizza on a table and covers his face with his palms.
>my day is ruined
Anonymous No.106195607 [Report]
>>106195539
Add photo, film still etc to your prompt, and if needed you could add things like: sketch, drawing, illustration, painting, art, cartoon

to your negatives, there is a LOT of control to be had with negatives
Anonymous No.106195618 [Report]
>>106195606
Anonymous No.106195630 [Report] >>106195689
>>106195486
>>106195509
>>106195455
Seems like scaled still takes a hit, but might be ok depending on what you want to gen

https://www.reddit.com/r/StableDiffusion/comments/1gc0wj8/sd35_large_fp8_scaled_vs_sd_35_large_q8_0_running/
Anonymous No.106195665 [Report] >>106196138
>Wanna jump with me anon ?
Anonymous No.106195689 [Report]
>>106195630
>>106195486
>>106195509
>>106195455
GRRRRRRRRRRRRRRRRRRRRRRR >>106195505
Anonymous No.106195720 [Report]
Anonymous No.106195721 [Report]
>>106195510
great gen, whats the prompt/lora for the digi photo look?
Anonymous No.106195724 [Report]
>im outta here
Anonymous No.106195734 [Report] >>106195773 >>106195788 >>106195986 >>106196138
>>106195464
Alright, v49 is downloading, but before that here's this prompt

>Amateur photograph of a stunning Japanese alt emo idol with noodle cup in her hand, she is in her couch, day time, bright, she is on fours, view from the front of her face, her soles are up in the air

Here v50 does look a little sharper, but I'm not sure if being more polished is what you want, after all Qwen images look clean as well but they are more slopped.
Anonymous No.106195764 [Report]
the man closes the pizza box. The top of the box says "LDG pizza" with a logo of Miku Hatsune on the box.

almost
Anonymous No.106195766 [Report] >>106195815
Anonymous No.106195773 [Report] >>106196018
>>106195734
holy esl what a prompt
Anonymous No.106195775 [Report] >>106195793
>>106195505
OK thanks anon.
Anonymous No.106195788 [Report] >>106195810 >>106195849 >>106195860 >>106195990 >>106196138
>>106195734
Alright, slightly altered the prompt to add panties to make it SFW. Much better results on this prompt overall, but win seems to go to v48. v49 is not bad though, will try other prompts
Anonymous No.106195793 [Report]
>>106195775
i was talking about FP8_scaled btw, ggufs are slower
Anonymous No.106195810 [Report]
>>106195788
slopped :(
Anonymous No.106195815 [Report] >>106195937
>>106195766
great gen anon
Anonymous No.106195820 [Report]
>>106195231
>>106195316
I think he is right. Changed the resolution from 1024 and the flux gens seem to have stopped.
Anonymous No.106195849 [Report]
>>106195788
Well, v49 is v48 + one epoch of 1024 resolution training

v50 and v50 annealed are frankenstein merges with other high resolution training tests, they might be better for some things, but I haven't come across any yet
Anonymous No.106195860 [Report] >>106196364
>>106195788
It seems that subsampling the dataset obliterated the organic look Chroma had
Anonymous No.106195883 [Report]
ok, there's a miku
Anonymous No.106195899 [Report]
>W-who are you ? And what do you mean with /ldg/ ?
Anonymous No.106195937 [Report]
>>106195815
thanks
Anonymous No.106195986 [Report] >>106196018 >>106196130
>>106195734
your prompt is shit.

shit goes in - shit comes out
Anonymous No.106195990 [Report]
>>106195788
try higher than 1024 res for the 49/50
Anonymous No.106196009 [Report] >>106196018
is unipc still the recommended for wan2.2?
Anonymous No.106196018 [Report] >>106196052 >>106196056
>>106195773
>>106195986
bros wtf am i supposed to be prompting??
i dont want to pay for grok just to gen with chroma
>>106196009
try lcm and see which u like better
Anonymous No.106196019 [Report] >>106196530
VACE 2.2 when?
Anonymous No.106196030 [Report] >>106196066 >>106196138
>>106195087
v49
Anonymous No.106196043 [Report] >>106196056
Hello
Anonymous No.106196052 [Report] >>106196070
>>106196018
you can literally use chatgpt for free you iliterate twat.

> in her couch
IN her couch?! what.

> is on fours
on what fours? it's "she is on all fours" but this still doesn't make sense because you say her feet are up, retard.

honestly. 99% of issues is always the person prompting, not the model.
Anonymous No.106196056 [Report] >>106196116
>>106196043
>>106196018
can you show that anon how you properly prompted for a jap woman
Anonymous No.106196066 [Report]
>>106196030
Why did you enlarge it ? I can see the fucking pixels
Anonymous No.106196070 [Report]
>>106196052
though the iliteracy and obvious laziness make sense, given he's a footfag
Anonymous No.106196116 [Report] >>106196129
>>106196056
just use any free llm, chroma needs lots of yapping i think

>Amateur snapshot of a striking Japanese alt-emo idol with vibrant dyed hair and dramatic makeup. She's positioned on all fours on a cozy couch during bright daytime lighting, with her face directly facing the camera and soles of her feet prominently visible in the air. Her delicate hands clutch a steaming instant noodle cup with chopsticks. Soft natural light streams through windows, creating gentle highlights on her porcelain skin. Background features cozy domestic setting with throw pillows and blankets. Style: candid amateur photography with shallow depth of field, slightly overexposed bright tones, and intimate framing. Emphasis on youthful energy, kawaii aesthetic meets alternative fashion, and playful domestic moment. High-detail capture, authentic snapshot quality

copypaste this prompt https://github.com/QwenLM/Qwen-Image/blob/45bccb917f87c40fb557a4f0d5bff8350047332c/src/examples/tools/prompt_utils.py#L42
Anonymous No.106196129 [Report] >>106196181
>>106196116
you might have the tism
Anonymous No.106196130 [Report] >>106196153
Prompt
>Amateur photograph, a stunning Japanese female cosplaying as sailor, sitting in front of a restaurant at night, she is holding a drink with left hand, and food with right hand, she is candidly laughing

neg
>3D, render, drawing, vintage, bokeh

https://files.catbox.moe/qbpnxq.png

With that, I conclude v48 is best for photorealism, yeah.

>>106195986
I know it's shit, but no amount of prompt engineering is going to fix the sloppiness, look at >>106194380 it was VLM generated, and the output looks even worse with no negs.
Anonymous No.106196138 [Report] >>106196163 >>106196177 >>106196180
>>106196030
>>106195788
>>106195734
>>106195665
All this talk about Chroma gens reinforces my argument that what you're proposing with your blatant shilling doesn't bring anything new to the local ecosystem or NSFW. All these images can be created with 10 times better quality in SDXL with any checkpoint finetuned for realism, much more quickly and easily and less heavier.
Stop wasting time with this crap.
Anonymous No.106196153 [Report] >>106196188
>>106196130
This could be do with SD 1.5
Anonymous No.106196162 [Report]
Anonymous No.106196163 [Report] >>106196225
>>106196138
Then use your SDXL, nobody is stopping you

Better yet, post your SDXL gens to convince people

Just stop bitching about Chroma like an absolute mental case
Anonymous No.106196177 [Report]
>>106196138
Go anon.

Show me your SDXL versions. I will make it easier for you, I will not post a catbox that shows your obvious lying.
Anonymous No.106196180 [Report]
>>106196138
tell me more about this sdxl
Anonymous No.106196181 [Report]
>>106196129
you definitely have sickle cell anemia
Anonymous No.106196185 [Report] >>106196199 >>106196203 >>106196206 >>106196227
Fix v50 lora. My dataset sucks though.
Anonymous No.106196188 [Report]
>>106196153
I mean, if anon can really do this with SDXL or 1.5, I want to see it.
Anonymous No.106196199 [Report]
>>106196185
I need her.
Anonymous No.106196203 [Report] >>106196227
>>106196185
Whoops wrong gen. Meant to post this one.
Anonymous No.106196206 [Report]
>>106196185
it has that 90's tv movie vibe going
Anonymous No.106196225 [Report] >>106196246
>>106196163
Not explaining myself to you, backed by entire AI sites. You're wasting your time, faggot. Any thing you share about new chroma version are worst and worst. You are lost, unable to fix your product.
Anonymous No.106196227 [Report]
>>106196185
>>106196203
w-whut does she look like in leggings or perhaps panties tho
Anonymous No.106196230 [Report] >>106196277
Some of my chroma 50 gens come out super blurry, what's up with that? Never had that before.
Anonymous No.106196246 [Report] >>106196282 >>106196316
>>106196225
Your whole life is focused on Chroma failing, like how sad is your existence ? You can just not use it, how can you be this obsessed ?
Anonymous No.106196268 [Report]
Imagine caring about what models a random anon may or may not use
Anonymous No.106196277 [Report]
>>106196230
are you using a gguf from the reddit post? some of them had that issue and got deleted later
Anonymous No.106196282 [Report]
>>106196246
Because he can't run Chroma. He's never posted a gen either. He's just a troll
Anonymous No.106196283 [Report]
>>106194984
his muscles were just smoke
he is fat as fuck
Anonymous No.106196316 [Report] >>106196348
>>106196246
You are shilling a product, dev, expect reviews good and bad. And here is a bad one(or mayne a real one not made by you), deal with it. The quality is dropping, your posts show it and for every fix you attempt ten things worsen. You are lost, you don't know what to do to improve your product.
Anonymous No.106196324 [Report] >>106196337 >>106196397
I appreciate chroma and qwen but the anime models simply have more sovl than them, even the most random shitmix
Anonymous No.106196330 [Report]
lodestone should be dragged out on the street and shot
Anonymous No.106196337 [Report]
>>106196324
That's a very compelling example, anon.
Anonymous No.106196348 [Report] >>106196382 >>106197059
>>106196316
kek, I'm not the 'dev' of Chroma, your mind is fractured, in fact I'm in no way affiliated with Chroma or any other model, unless you count visiting their discord channels
Anonymous No.106196364 [Report] >>106196372
>>106195860
Didn't entirely eliminate it, but it made it very less likely that you do get an organic looking gen.
Anonymous No.106196372 [Report]
>>106196364
In fact, if you train a realism LoRA, you probably wouldn't notice this issue.
Anonymous No.106196375 [Report]
>>106195464
Anonymous No.106196382 [Report]
>>106196348
arguing here is pointless. there are far too many mentally unstable people here to hold an actual conversation or god forbid a real argument that doesn't devolve into "nu uh".

i will continue to use chroma just to make the idiots suffer.
Anonymous No.106196397 [Report] >>106196617
>>106196324
Anonymous No.106196450 [Report]
lodestone it is time for the qwen finetune please
Anonymous No.106196462 [Report]
Can anyone explain the process or the workload of integrating image loras into a wan node tree?

I am running wan2.2 i2v, but sometimes a character in the input image has their eyes closed, which means their eye information is lost. When this is the case, I adjust the prompt to make sure their eyes remain closed for the whole video.

Assuming there was a good quality illustrious lora for the character in question, could i integrate it and somehow let wan know that a specific character from the input image should inherit lora data from a specific lora? Maybe something like how region mapping + controlnet works with masking, except for i2v videos.
Anonymous No.106196472 [Report]
what I remember most from the SD to HD transition was how many actors' careers didn't survive it. brad pitt's craggy face didn't hold up well, while catherine zeta jones's uninteresting perfection became a showcase display for the benefits of the new medium.
the results I'm getting with v50 remind me of that. what looked pretty good two epochs ago with the 512px beer goggles on now highlights every liverspot and cheesy toenail. it's like the "gross-ups" from ren & stimpy.
I'm haven't cracked myself up this much since chroma first launched. what a fucking hoot.
Anonymous No.106196474 [Report]
>That's a man, baby
Would you anon, knowing it's a dude ?
Anonymous No.106196487 [Report]
>>106195464
good testing anon, I'll stick with v48
Anonymous No.106196498 [Report]
how do you anons manage to use lightx2v without destroying obvious details and motion like hair moving in wan2.2? or even faces for that matter
I can't find a way to make it less horrible in impact
Anonymous No.106196511 [Report]
what's the deal with this qinglong business? I usually use sigmoid if I need fine details
I wish testing this stuff properly didn't take hours to days
Anonymous No.106196530 [Report]
>>106196019
I NEED IT
Anonymous No.106196554 [Report]
>>106194984
the more he is animated the more he looks like george michael.
Anonymous No.106196575 [Report] >>106196588 >>106196627 >>106196730
what a SLVT
Anonymous No.106196588 [Report]
>>106196575
Wholesome chungus!
Anonymous No.106196617 [Report]
>>106196397
Anonymous No.106196627 [Report]
>>106196575
this is a children's board oh noes
Anonymous No.106196638 [Report] >>106196662
"can your sd1.5 model do that"
Anonymous No.106196662 [Report]
>>106196638
that's just your average anons feet
Anonymous No.106196705 [Report]
can 2.2 do 720p yet?
Anonymous No.106196722 [Report] >>106196792
>loving wan
>think gens are smooth af
>best of the best
>trying to rip pic off insta for img2prompt
>notice its ai genned, view desc
>#midjourney
>havent seen that in a while, lets see where its at now
>new video model
>absolutely gorgeous and best ai vids ive seen so far

bleak for local
Anonymous No.106196730 [Report]
>>106196575
Anonymous No.106196748 [Report]
local diffusion general?
Anonymous No.106196765 [Report] >>106196860 >>106197032 >>106197065 >>106197094
> A hyper-realistic POV shot, grainy imperfect 2000s CCTV footage, from a ceiling-mounted security camera inside a brightly lit grocery store. A large male cat head is looking at a female customer. Fluorescent halogen lights cast cold bluish tones.The camera is stationary, with a subtle VHS-style time-stamp overlay like 'CAMERA 02 2009/04/17 14:32:07'. No dramatic music, just low ambient hum. Aspect ratio 4:5 vertical. Color grading should match old security footage with muted, slightly greenish hues.
Anonymous No.106196792 [Report] >>106196819 >>106196901 >>106196993 >>106197179
>>106196722
Nah Hailuo 02 is superior to that slop. It's the best video model ever made

https://youtu.be/5yI9wEys2dc?t=251
Anonymous No.106196819 [Report] >>106196832
>>106196792
wow you werent kidding, this can seriously be used professionally wtf
Anonymous No.106196832 [Report] >>106196854 >>106197061
>>106196819
Yeah, China shitted on Veo 3 a while back, it's wild

https://xcancel.com/WuxiaRocks/status/1935298213613027521#m
Anonymous No.106196854 [Report]
>>106196832
> june 18th
the chinks are fucking cracked wtf?? the reflections and physics are perfect AND has sound????
Anonymous No.106196860 [Report]
>>106196765
Anonymous No.106196893 [Report] >>106196906
Harry Potter if he anime
Anonymous No.106196901 [Report]
>>106196792
the chinaman is on FIRE, goddamn! when local?
Anonymous No.106196906 [Report] >>106197072
>>106196893
a redditor faggot already did it. both this and the other are completely shit
Anonymous No.106196911 [Report]
>>106194093
Catbox?
Anonymous No.106196979 [Report]
Chroma is a subpar model for realism and styles. It has deformed feet and hands, poor proportions, and zero community support. Why use a mediocre model when you can get better results faster with embeddings or any SDXL model? Tags are better.
Anonymous No.106196983 [Report]
Anonymous No.106196993 [Report]
>>106196792
but the boring minimax, doesn't even sell infinite usage for Hailuo 2. I definitely prefer the free wan,kek
Anonymous No.106196997 [Report] >>106197000 >>106197003
when will the wan chinks fix the overtalking problem? things getting a little weird now.

https://litter.catbox.moe/h33yt9eyfauh53fu.mp4
Anonymous No.106197000 [Report]
>>106196997
kek
Anonymous No.106197002 [Report]
Is there a way to replicate the dynamic calculation of SNR to switch from the HIGH wan model to the LOW one like the official repo does?
Anonymous No.106197003 [Report]
>>106196997
A trick I've been trying is do a vace vid 2 vid gen with multitalk but give it a silent audio clip. Still sucks you gotta do that though.
Anonymous No.106197032 [Report]
>>106196765
kek
Anonymous No.106197052 [Report] >>106197068
The flan-t5xxl changes things too much
Anonymous No.106197059 [Report]
>>106196348
Your product has no market, no target, no audience, no strengths, and is below average.
Here and out there, people use SDXL and WAN. Furries and bronies stick with it too.
Shilling here and replying to my posts shows your failure and your product's failure.
Call me what you want, I'm behind a screen and until proven, you're a Chroma dev
Anonymous No.106197061 [Report] >>106197070
>>106196832
Is something close to this even possible in wan 2.2? Does it knows what a skate grind is?
Anonymous No.106197065 [Report] >>106197103
>>106196765
Model?
Anonymous No.106197066 [Report]
>106197059
mindbroken
Anonymous No.106197068 [Report]
>>106197052
It's a finetune, it is supposed to do that. It's like saying Pony/Illustrious changes things too much from SDXL. Well, duh?
Anonymous No.106197070 [Report]
>>106197061
Wan 2.2 is not even Veo 2 tier.
Anonymous No.106197072 [Report]
>>106196906
Yeah I just got done testing them. Absolute garbage. Wan low noise v2v unironically better.
Anonymous No.106197087 [Report]
go let your chatbot do something else for a bit anon
Anonymous No.106197094 [Report]
>>106196765
wan2.2
Anonymous No.106197103 [Report]
>>106197065
meant to reply to this, wan2.2 t2v
Anonymous No.106197179 [Report] >>106197181
>>106196792
Hailou 02 test
Anonymous No.106197181 [Report]
>>106197179
Wan 2.2 I2V
Anonymous No.106197182 [Report]
any way to get Wan2.2 to stop generating a png along with my mp4? i'm using kijai's workflow from the rentry guide
Anonymous No.106197186 [Report] >>106197192 >>106197202 >>106197211
>skip a few threads
>seeing lightx2v lora update
>come back
>every clip same shitty slow-mo
When it ends bros?
Anonymous No.106197192 [Report] >>106197202
>>106197186
chinks make great foundational models. they make the shittiest optimizations however
Anonymous No.106197200 [Report] >>106197221
>chroma-unlocked-v50.safetensors
is this the final version?
Anonymous No.106197202 [Report] >>106197218
>>106197192
>>106197186
works on my machine
Anonymous No.106197211 [Report]
>>106197186
just make it run at a higher fps lol
Anonymous No.106197218 [Report]
>>106197202
we know it works but it looks like shit
Anonymous No.106197221 [Report] >>106197250
>>106197200
unfortunately yes
Anonymous No.106197248 [Report] >>106197279
https://huggingface.co/lodestones/Chroma/discussions/100
>struggling with v50
joever
https://huggingface.co/lodestones/Chroma/discussions/99
>yeah v50 is the final "base" model, speed model will come after
Anonymous No.106197250 [Report]
>>106197221
pls... just two more epochs
Anonymous No.106197251 [Report] >>106197265
Anonymous No.106197265 [Report] >>106197288
>>106197251
what processing did you do on this anon? Looks great, good length too. what was original output and what was it frame interp'd to?
Anonymous No.106197275 [Report] >>106197323
where is the chroma v50 fp8 version?
Anonymous No.106197276 [Report]
>>106193870 (OP)
is there any LLM AI model that doesnt need a GPU to do this? i have a shitty PC with no gucci video card why cant i just make pics with the CPU?
Anonymous No.106197279 [Report]
>>106197248
>https://huggingface.co/lodestones/Chroma/discussions/100
>literal who promptlet complaining
This thread gets enough of that already. Who cares.
Anonymous No.106197285 [Report] >>106197306 >>106197319
Man v50 really just released with just some general attention, the hype completely died out. Props for him actually fronting money but man, I really still feel model creators muck around too much with training and just keep going despite irredeemable mistakes because of sunk cost fallacy in the thousands of dollars.
Anonymous No.106197288 [Report] >>106197316
>>106197265
this is just wan 2.2 i2v, i took the last frame of the first gen, genned again and just concatenated the two videos
frame interpolation with FILM VFI node
Anonymous No.106197291 [Report] >>106197339
playing around with img2img with wan2.2 and llm prompt interrogation from original image
Anonymous No.106197306 [Report]
>>106197285
If it hadn't released around the same time as Wan 2.2 or Qwen then it would've gotten more attention.
Anonymous No.106197309 [Report]
where is the chroma v50 nunchaku version
Anonymous No.106197310 [Report]
>>106195231
unless the dataset is full of synthsloppa, there is no way the image on the right is its distribution
Anonymous No.106197311 [Report] >>106197325
Is 2x24GB enough to train a Chroma lora?
Anonymous No.106197316 [Report] >>106197346
>>106197288
wow, couldnt even see the seam, when ive tried prior it would always have slightly different camera movement which would make it a blatant splice
Anonymous No.106197319 [Report]
>>106197285
>body horror anatomy still present with more than 1 subject or non-close up shots
>v50 somehow more slopped with plastic skin and ultra bokeh
i don't know what i expected
Anonymous No.106197323 [Report] >>106197332
>>106197275
https://huggingface.co/Clybius/Chroma-fp8-scaled/blob/main/v50/chroma-unlocked-v50_float8_e4m3fn_learned_svd.safetensors
Anonymous No.106197325 [Report]
>>106197311
A single 24gb gpu is enough
Anonymous No.106197332 [Report]
>>106197323
svd?? qUANT?!?!?!
Anonymous No.106197339 [Report] >>106197416 >>106197442 >>106197460
>>106197291
original always on left, right is wan upscale
> 0.45 denoise, anime tokens taken out of prompt
Anonymous No.106197345 [Report]
Maybe I'm missing something since other people keep swearing that v50 is perfect, but the only way I can get Chroma to be functional and not give me extreme body horror or artifacts is by using the Flash models/loras. But then you don't have a negative prompt so the fuck's the point?
Anonymous No.106197346 [Report] >>106197359
>>106197316
yeah the seam is there but hard to spot if you're not looking for it. i think adding (shaky selfie cam:1.2) to the prompt helped in this instance
but otherwise it's a bog-standard wan gen, with the 2.2 lightning loras at 1 strength and 3+3 steps
Anonymous No.106197354 [Report] >>106197365
does chroma work with flux lora and flux redux? these are the reason why flux ecosystem is so good
Anonymous No.106197358 [Report] >>106197367
Hailou 02 test
Anonymous No.106197359 [Report]
>>106197346
preciate the info anon, thanks
Anonymous No.106197365 [Report]
>>106197354
I've seen people say they were able to get both working, yeah. I think it kind of depends on the lora
Anonymous No.106197366 [Report]
What should I type into the positive prompt if I want to gen images like this one?
Anonymous No.106197367 [Report] >>106197380 >>106197447
>>106197358
Wan 2.2 I2V
Anonymous No.106197375 [Report] >>106197387
Is the Chroma 1-HD just renamed V50?
Anonymous No.106197380 [Report] >>106197453
>>106197367
they really fucked up with the constant talking thing, it ruins a lot of videos
Anonymous No.106197387 [Report]
>>106197375
yes
Anonymous No.106197405 [Report]
Anonymous No.106197416 [Report] >>106197442
>>106197339
is left a gen?
Anonymous No.106197442 [Report]
>>106197339


>>106197416
screencap from Evangelion
Anonymous No.106197447 [Report]
>>106197367
did you overcompress this or is that the normal output?
Anonymous No.106197453 [Report] >>106197569
>>106197380
strange. no one talks in my realistic gens. i'll try anime
Anonymous No.106197460 [Report] >>106197472
>>106197339
why even try this? there are far better image upscalers. This seems like an exercise in retardation.
Anonymous No.106197472 [Report] >>106197488 >>106197505 >>106197518
>>106197460
not an upscale, its just 1:1 img2img with wan and basic denoising, just curious to see how it handles it
Anonymous No.106197488 [Report]
>>106197472
ignore the retard anon keep going
Anonymous No.106197505 [Report]
>>106197472
going to try for prompt only style changes now, probably realism <-> anime to start
Anonymous No.106197518 [Report]
>>106197472
Boo, nta, but I thought that was prompt interrogation.
Anonymous No.106197529 [Report]
>>106197528
>>106197528
>>106197528
Anonymous No.106197569 [Report]
>>106197453
yes only happens in anime or cartoon
Anonymous No.106197867 [Report]
>>106194386
I want to know more