← Home ← Back to /g/

Thread 106293782

333 posts 196 images /g/
Anonymous No.106293782
/ldg/ - Local Diffusion General
Will Deal With You One By One Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106288550

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://github.com/Wan-Video
2.1: https://rentry.org/wan21kjguide
2.2: https://rentry.org/wan22ldgguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-HD/tree/main
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.106293807
blurry thread of chroma artifacts
Anonymous No.106293814
cursed thread of baked favoritism
Anonymous No.106293820 >>106293825 >>106293849 >>106293854
Anonymous No.106293821
Blessed thread of frenship
Anonymous No.106293825 >>106293844
>>106293820
o ma gawd catbox please
Anonymous No.106293828
Anonymous No.106293837 >>106293957 >>106294139
>>106293468
same prompt in chroma with no LORA... had to pick best out of 10 because chroma still has anatomy and hand issues. v49 had the best results, 48d gave absolutely trash backgrounds and couldn't handle a higher res either
Anonymous No.106293844 >>106293862
>>106293825
petitgoth
Anonymous No.106293849
>>106293820
zamn that skin tone is top notch
Anonymous No.106293854
>>106293820
Sauce me, boss. Appreciate you.
Anonymous No.106293862
>>106293844
why are you posting this in MY /ldg/?
Anonymous No.106293864 >>106293870
thats just some OF thot
Anonymous No.106293870 >>106293879
>>106293864
sorry, nothing against your thread you were just the latest
Anonymous No.106293874
what is wrong with you? you fucking NIGGER. you fucking WELL POISONING JEW.
Anonymous No.106293879
even worse, reddit whore
>>106293870
but whats the point of posting that slut in latest thread?
Anonymous No.106293888
I will now fap.
Anonymous No.106293896 >>106293915 >>106294060
theres nothing fappable here tho
https://coomer.st/onlyfans/user/amberpetite
Anonymous No.106293908
>15 posts (not including op)
>1 AI-generated image
Anonymous No.106293915
>>106293896
Snoozefest
Anonymous No.106293932
people pay for this..? man..
Anonymous No.106293957
>>106293837
v48 for comparison, look at those fucked up buildings lol
Anonymous No.106293962 >>106294752
I am training a controlnet using a dataset of 1024x1024 gens I got from 4chan generals, and then I made them into 64x64 sprites then upscaled into 1024x1024 blocky shit.

Hopefully I can create a controlnet that can do pix2pix from pixel art into HD.
Anonymous No.106293968 >>106293996 >>106295007
***OFICIAL CHROMA COPE LIST FROM LAST THREAD***

Know your Chromacoppers!:

"It's a good starter model" (>106291752)

"You're just a hater" (>106291808)

"Best for NSFW" (>106291947)

"Better than everything else(SD1.5/SDXL/Flux)" (>106291978)

"Critics are trolls" (>106292229)

"Popular doesn't mean good" (>106292891)

"Users are just incompetent" (>106292990)

"Skill issue" (>106293062)

"Flawed doesn't mean broken" (>106293313)

"Show your work" (>106293333)

"You're exaggerating" (>106293523)

"It's getting better" (>106293635)
Anonymous No.106293993
Wish I had a giganigga super computer
This is one of the threads of all time
Anonymous No.106293996
>>106293968
>Best for NSFW
This one is true if Illustrious isn't being considered
Anonymous No.106294000
nice chromacache. the node sucks ass
Anonymous No.106294010
Anonymous No.106294016
wow that isn't mental illness at all>106293968
Anonymous No.106294018 >>106294120 >>106294149
the polygon anime girl turns around and walks down the aisle of the grocery store.

wan 2.2 is so good.
Anonymous No.106294036 >>106294773
Anonymous No.106294060
>>106293896
she would unironically be cuter with a nice little feminine penis tbqhwy
Anonymous No.106294120
>>106294018
I want her to beat me with that leek in HDR
if you know what I mean
Anonymous No.106294139 >>106294191 >>106294344
>>106293837
>Being this much of a promptlet

Pic rel is my first try anon. And before you say the background is fucked up look up what a real favela looks like anon.

https://files.catbox.moe/99dign.png
Anonymous No.106294149
>>106294018
Impressive how wan kept the shading on a single cell after she showed her back
Anonymous No.106294191
>>106294139
>that AK47
Anonymous No.106294232 >>106294244 >>106294248 >>106294607 >>106294891
Help me /ldg/ which Chroma version is best for a footfag?
Anonymous No.106294244
>>106294232
Chromatoes.
Anonymous No.106294248
>>106294232
v48 or v49
Anonymous No.106294276 >>106294307 >>106294747
Combining Natural Language Prompts and *booru style tagging is in direct violation of God's will.
Anonymous No.106294306 >>106294782
>Set up wan 2.2
>only have 12 gb vram
>using full fp8
>just works
>5x faster than 2.1 was
I love the chinks
Anonymous No.106294307
>>106294276
I think that's the best, prose is bloated for some things, and the tags for other things are incompetent
Anonymous No.106294319 >>106294384
Anonymous No.106294344 >>106294607
>>106294139
I was comparing a prompt between qwen and chroma though. so doing additional prompt engineering would defeat the purpose.

>before you say the background is fucked up look up what a real favela looks like anon.
your favela looks OK, though qwen's favela rendition is superior to any chroma favela ITT. it was v48d that had a truly fucked up one. I used to be partial to 48d but imo 49 is the least shitty chroma version rn

also
>LLM-generated prompt
cringe
Anonymous No.106294371
Anonymous No.106294384 >>106295334
>>106294319
nice, workflow?
Anonymous No.106294400
It seems img2img can do just fine low res pixel art to HD.
Anonymous No.106294412
Top is with chroma cache set to 1. Bottom is no cache. Any higher number is even worse
Anonymous No.106294513
the polygon anime girl throws the green leek vegetable at the camera.

nice magic trick, miku
Anonymous No.106294607 >>106294677
>>106294232
Any version is good. Pic rel is v50 (1152x1152) or you can go down to any version and still get good results.

>>106294344
You're implying a trash result is a Chroma flaw though, when in reality it's either a bad prompt or bad settings. I used LLM because I have found it can help against artifacts. I can tone down my prompt (not use the LLM) and still get results better than what you've posted so you're doing something wrong. Those results look like the VAE or sampler settings are wrong.
Anonymous No.106294677 >>106294892
>>106294607
>You're implying a trash result is a Chroma flaw though
lmao it is though... that's exactly what it is. Same prompt, Qwen gets it right and Chroma poops itself. I bet Wan does it well too.

Yes you can engineer prompts to avoid Chroma's flaws. I've been genning with Chroma since v22. but it simply is not as capable as qwen or wan at basic anatomy and architecture. its strong points are styles and nsfw. hopefully the new versions the furry is cooking will fix its problems.

>I used LLM because I have found it can help against artifacts
you do you. I don't like prompt bloat, and I've had good results from chroma without it.
Anonymous No.106294682 >>106294703 >>106295216
>Allocation on device
>This error means you ran out of memory on your GPU.

I just genned 5 videos with the same setting you piece of shit
Anonymous No.106294703
>>106294682
comfy has some fuckin memory leaks i also have to reset every 5-10 gens, 64gb ram, 24gb vram
Anonymous No.106294714
Anonymous No.106294720 >>106294736
Guys please educate me on SD 1.5
I'm getting into lora training and realized its much faster on 1.5 compared to SDXL. So I want to practice and learn its quirks on it first before going back to sdxl with some decent datasets.

I'm used to sdxl checkpoints, how is the scene nowadays for 1.5? Anything that can do hardcore NSFW reliably like lustify on sdxl? I'm having moderate sucess on realvision but doing only solo posing and stuff gets old fast.
Anonymous No.106294722 >>106294748 >>106294781
why are people shilling qwen so hard if it's from alibaba? isn't it corporate censored crap? why even compare it to chroma that is trained on porn
Anonymous No.106294730 >>106294803
Any news on long video gens yet? Saw some extended wan workflows but they're all nest sub graph shit, dont really feel like taking it apart to add the all in one wan 2.2 model
Anonymous No.106294736 >>106294758 >>106294880
>>106294720
>how is the scene nowadays for 1.5
It's extremely outdated and nobody really makes anything for it but it's fun to go back to sometimes
>Anything that can do hardcore NSFW reliably like lustify on sdxl
No, SD1.5 just can't do that stuff like later models can.
Anonymous No.106294747 >>106294762 >>106294899
>>106294276
I wonder how good it would be if it were trained ONLY on natural language. joycaption is so good sometimes

also does anyone know how much money it would have cost to train chroma up until now?
Anonymous No.106294748
>>106294722
It has a good license and has good prompt adherence. It's censored in the sense that they didn't train on NSFW stuff but there's nothing stopping anyone from training it on NSFW themselves other than the fact that the model is massive and it would cost a small fortune.
Anonymous No.106294752
>>106293962
Can't help since controlnets are outside of my training experience, but I'm rooting for you anon!
Anonymous No.106294758 >>106294776
>>106294736
>It's extremely outdated and nobody really makes anything for it but it's fun to go back to sometimes
I heard the lora scene is way more mature over there. I thought I could at least get some testbed setup going to cover these kind of gens. Seems like I got rused then.
Anonymous No.106294762 >>106294874
>>106294747
lodestone said it's cost him $150k by now. Probably a little more since that was before he started retraining.
Anonymous No.106294773
>>106294036
So many unextected events in this video, lollypop magic, exciting paper craft
Anonymous No.106294776 >>106294808 >>106294823
>>106294758
The LoRA scene is more mature for SD1.5 since it was a) smaller and cost less/was easier to train on, and b) is much older so much more exists for it by virtue of the fact that it was around forever. Nobody makes anything for it anymore, though.
Anonymous No.106294781
>>106294722
it's the most advanced and capable local model, by far. and has a good license. yeah it's censored though and has some weird blind spots.
Anonymous No.106294782
>>106294306
Faster with the lightx2v lora which existed for 2.1.
Anonymous No.106294795 >>106294799 >>106294818 >>106294843
does qwen mog flux?
Anonymous No.106294799 >>106296180
>>106294795
Almost anything mogs flux. SD 1.5 mogs flux.
Anonymous No.106294803 >>106295113
>>106294730
Why the frick are you using the all in one?
Anonymous No.106294808
>>106294776
I miss being able to finetune like a boss on SD1.5

My 3090 handles training Flux / Chroma loras fine, but then there's Wan and Qwen, and suddenly a 3090 feels really slow
Anonymous No.106294818
>>106294795
Flux isn't even close
Anonymous No.106294823 >>106294868
>>106294776
>Nobody makes anything for it anymore, though.
new loras are added and finetunes are added to civitai pretty regularly actually. but i suppose maybe you meant "nobody makes anything GOOD for it anymore", which fair probably right. fun to load it up every now and then and just rapidfire a bunch of images off for the lulz.
Anonymous No.106294824 >>106294916 >>106295145
auraflow chads our time will come
Anonymous No.106294830
is there any table with how long it takes to train wan 2.2 lora based on how many input videos
I have 3090 ti
Anonymous No.106294842
anime style Miku Hatsune on the cover of a Nintendo Switch videogame in pixel art style. On her arm is the text "01" in red text.

actually got most of the logos right
Anonymous No.106294843
>>106294795
yeah, base flux is a joke in comparison, in every way.
Anonymous No.106294861 >>106294897
I will try qwen
rentry on how to do qwen local pls?
Anonymous No.106294868
>>106294823
Yep its blazing fast on my 4090, need to watch when genning stuff on a loop to not suddenly have 1000 new files lol. Could be interesting to test some mass processing scripting.
Which checkpoints + lora you recommend to extract the max out of it?
Anonymous No.106294874
>>106294762
>lodestone said it's cost him $150k by now.
Now I get it.
Anonymous No.106294880
>>106294736
>No, SD1.5 just can't do that stuff like later models can.
ahh the good ol days of models that absolutely cannot do anything but a 7 fingered 1girl, standing
Anonymous No.106294885 >>106294897
anime girl Miku Hatsune on the cover of a Nintendo Switch videogame called "Miku Hatsune project Qwen" in pixel art style. She is singing at a concert, with a spotlight shining down on her. On her arm is the text "01" in red text.

qwen q8 distilled + 8 step lightx2 lora, at 8 steps.
Anonymous No.106294890 >>106294904
What drives a man to spend $150k on an AI model?
Anonymous No.106294891
>>106294232
v50 bf16 with height and width +128px your regular one
Anonymous No.106294892 >>106294918 >>106294922 >>106294965 >>106294982
>>106294677
Anon, here's a very basic type of image that Qwen can not do. When you are overtrained on polished or clean images, you cease your ability to create images like this. That is enough for me to call it trash.

>Amateur photograph of Japanese maid cosplay sitting a Tokyo Metro train, she is wearing stockings and her panties are slightly visible, she is asleep, creepshot
blurry 240p home video. Youtube.com video player UI.

>Qwen gets it right, and Chroma poops itself

This is what a favela looks like
https://gdb.voanews.com/cc7b4408-b3a2-4b86-b819-0d8423c220c7_cx0_cy10_cw0_w1080_h608_s.jpg
https://upload.wikimedia.org/wikipedia/commons/7/7e/1_rocinha_favela_closeup.JPG

This is the 2nd Chroma gen for your retarded prompt anon. While not flawless, it's still more realistic than the Qwen gen. You're just arguing in bad faith, because Chroma can give you exactly what you want to depict.

https://files.catbox.moe/y6pj2w.png

I'm sorry anon, I don't have cure for retardation. Here's one last shot at the retarded gen, no fucked up limbs, on my literal 3rd try, and no fucked up limbs because with my settings and prompt I haven't had that happen once
https://files.catbox.moe/a55ph7.png
Anonymous No.106294897 >>106294995 >>106295060
>>106294861
use the template from comfy

get q8 distilled or regular

get the lightx2v lora (4 or 8 steps)

gen

>>106294885
also fixed gen here:
Anonymous No.106294899 >>106295044
>>106294747
>I wonder how good it would be if it were trained ONLY on natural language
Look at qwen for that, you have shit seed diversity when doing anything except paragraph of prose then
Anonymous No.106294904 >>106294942 >>106294959
>>106294890
He's a rich furry with money to blow on expensive AI projects. I'm not complaining as long as we get more models to use.
Anonymous No.106294916
>>106294824
Aurapooflow never got off the ground, when the (then) best tuner furfag in the business couldnt make it better than even basic flux, you know its fucking over.
Anonymous No.106294918 >>106294960
>>106294892
>qwen can't gen shiity crusty deepfried 144p youtube screenshots
for some reason i'm not mourning this loss
Anonymous No.106294922 >>106294965 >>106294975 >>106294982
>>106294892
But go on anon, I bet you will continue lying about how Qwen is superior even though Chroma can (without surprise) pull of any kind of NSFW prompt you throw at it effortlessly.
Anonymous No.106294939 >>106294997 >>106295052 >>106295135
In case someone wants play with the Dall-E Pixar style test lora (and to make the Chroma hater seethe some more)

Use 'pixar style' as the trigger, it's not a very good set, only 60 images and 1:1 cropped by my hacky autocropper and trained at 512 resolution, but it does surprisingly well all things considered

https://files.catbox.moe/hahh7k.7z
Anonymous No.106294942 >>106294961 >>106294964
>>106294904
Did he literally say verbatim that he's rich? Sometimes, ordinary people are the most naive ones and are the ones who do this kind of crazy money spending.
I doubt that a millionaire who became a millionaire through his own efforts would even consider spending that amount of money on that.
Anonymous No.106294959
>>106294904
Is he though ? I'm thinking it's more like the furry community is loaded with cash for some reason

The trannies are poor, the furries are rich, weird how things work out.
Anonymous No.106294960 >>106294975
>>106294918
It can't gen depict the prompt period, whether you are asking for a grainy image or not, while Chroma can do both, in any situation and given any context.
Anonymous No.106294961
>>106294942
I'm just assuming since it's a common theme with furries. You'd think someone who's rich wouldn't bother but for some reason a ton of rich techfags are furries and will blow shittons of money on commissions, fursuits, etc. I'm just guessing he's the same but blows money on AI training stuff instead.
Anonymous No.106294964
>>106294942
some people are randomly rich / lucky / fortunate
Anonymous No.106294965
>>106294922
>>106294892
autism
Anonymous No.106294969 >>106295097
Trying to get back in comfyui.

Is there a way to solve this?
Anonymous No.106294975
>>106294922
>>106294960
fucking lol
Anonymous No.106294982 >>106295007 >>106295455
>>106294892
>>106294922
trukeism

chroma isn't perfect but the fact it works with natural captions and is so open to goon material makes it the best. I don't do this stuff to generate trees and boring crap like that
Anonymous No.106294995 >>106295022 >>106295072
>>106294897
I have been away for awhile, wtf is qwen? back in my day it was all about flux and sdxl...
Anonymous No.106294997 >>106295007 >>106295032 >>106295056 >>106295223
>>106294939
>and to make the Chroma hater seethe some more
Really?
No man, I don't “seethe” or “hate,” don't personify me. I just speak my mind, it flows naturally, no strings attached. That’s how I keep the discussion going long term.
And you're an idiot if you think about me and dedicate posts to me.
Concentrate on leveling up your product and yourself as a man.
Don't sweat what I do or say.
Anonymous No.106295007
>>106294982
nobody disagrees with this except for the one schizo
>>106293968 >>106294997
it's just funny to see the same guy rushing over and over to slavishly defend chroma's honor while posting some of the shittiest gens imaginable as proof that it's the best
Anonymous No.106295009 >>106295046 >>106296257
Can you use custom clipL/G on reforge? Can't find an option in the UI for that.
Anonymous No.106295010 >>106295029 >>106295046
why does my illustrious pictures come like this?
Anonymous No.106295022 >>106295041
>>106294995
20b model that got released recently, bigger than flux. it has an apache 2.0 license so everyone is ditching flux for qwen
Anonymous No.106295029 >>106295047
>>106295010
share workflow
share screenshot
share feetpics
Anonymous No.106295032 >>106295056 >>106295061 >>106295223
>>106294997
I was just poking fun at you, relax my man

Truth be told, it wouldn't be as fun here without guys like you, I miss Rocket already, I would miss you as well

But no, I'm not training Chroma models to spite you, nor do I release them to spite you, I do it because it's fun to train Chroma models
Anonymous No.106295041 >>106295059 >>106295060 >>106295119
>>106295022
flux used to be fairly slow on my 16gbvram card
is qwen going to be even slower?
I'm liking these mikus I wanna gen some too, but if it's even slower than flux then damn...
Anonymous No.106295044 >>106295616
>>106294899
fair enough

but I'd rather type 200 word essay and get EXACTLY what I want without errors than play gacha. it's not even hard when you have joycaption, and/or LLM to help you make longer prompts
Anonymous No.106295046 >>106295061 >>106295066
>>106295009
>>106295010
kill yourself
Anonymous No.106295047
>>106295029
https://files.catbox.moe/xsz16o.png
Anonymous No.106295052 >>106295095
>>106294939
fucking close! can you make some tests, on realistic dall-e style?
Anonymous No.106295056 >>106295074 >>106295223
>>106295032
>>106294997
just kiss already!
Anonymous No.106295059
>>106295041
>is qwen going to be even slower?
Yes
Anonymous No.106295060 >>106295143
>>106295041
i'm pretty sure there are ways to run it faster than flux. 16gb should be enough. see >>106294897 and look into nunchaku
Anonymous No.106295061
>>106295046
also meant for >>106295032
Anonymous No.106295066 >>106295087
>>106295046
So, can you use custom clip files on reforge or not? If yes, how?
Anonymous No.106295072 >>106295143
>>106294995
essentially flux but better and with better prompt comprehension. Despite the q8 model being 21gb and me having 16gb, I made the miku gen in 26 seconds with no cpu loading or whatever.
Anonymous No.106295074
>>106295056
kek
Anonymous No.106295087 >>106295107 >>106295114 >>106295140 >>106295182
Apart from generating text in images, which I can do by opening any image editor after generating images with Forge.
Is there any reason why you generate such generic or common images that could be perfectly generated with SDXL? So far, I haven't seen anything that says (apart from generating text) “uh, I can't do this with SDXL.”

>>106295066
No, stop dreaming, and you will use Clip in Comfy, and the result will be the same.
Anonymous No.106295095
>>106295052
You mean train a lora on 'realistic' Dall-E style ?

Can it even do that ? All gens I've seen are artsy or very slopped

There was some other guy who posted a Dall-E chroma lora a few threads back, it had much more styles than this, perhaps it had realistic stuff as well ?
Anonymous No.106295097
>>106294969
Just call it intentional, walla, fixed
Anonymous No.106295107
>>106295087
What I mean is that the Anon who shared the Nuke Nuke fan art in the last thread, featuring a woman hugging a man while others observe, created it using SDXL. That's far more intricate than the Mikus you all are making now.
Anonymous No.106295109 >>106295116 >>106295127 >>106295129 >>106295293 >>106295629
How do I fix this?
Anonymous No.106295113
>>106294803
Cause I'm 16gb vramlet and all in WAN doesnt load all at once
Anonymous No.106295114 >>106295136
>>106295087
>Is there any reason why you generate such generic or common images that could be perfectly generated with SDXL?
no. /ldg/ makes much more sense when you realize that the people here are arguing about which massive sota model you should use to generate 1girl, standing, masterpiece, best quality, HD, 8k, on a beach, which has been a solved problem for 1-2 years now
Anonymous No.106295116
>>106295109
it's called art, sweaty
Anonymous No.106295119
>>106295041
No, with lightning lora + nunchuku its faster than sdxl ever was
Anonymous No.106295125
Kino gens ITT
Anonymous No.106295127
>>106295109
i pmd you the fix after i read your mind to extract the workflow you were using
Anonymous No.106295129 >>106295142 >>106295293
>>106295109
Lower you CFG from 12071127 to something like 4 ?
Anonymous No.106295132
damn ive just finished reading the cope of the chroma shitter in the last thread, like what the fuck lmao, v49 and v50 are FUCKED, the v50 merge completely destroyed any quality chroma had, v48 is unfinished and this retard is lamenting that this model is not being picked up for nunchaku optimization because le evil reddit didnt like it! like get a reality check, retard, why would anyone spend resources on a model which is currently not in a final state (v50 is being retrained as we speak)? why do you defend chroma so ardently? fucking retard mentality I swear
Anonymous No.106295135 >>106295241
>>106294939
“Wow, I'm going to download this Lora from Disney Pixar and Dalle right now!!, and I'm going to generate images all day using them!”
No one has ever said that in their entire fucking life.
Anonymous No.106295136
>>106295114
And one guy literally turbo autism over the same stupid blue haired anime girl always “testing new models” or some shit Basically 4chan same as it always has been
Anonymous No.106295140 >>106295151
>>106295087
You could use custom text encoders in Forge but unfortunately support for forge is tenuous at best while Panchovix is back maintaining and updating reforge. So I guess he hasn't implemented the option for custom text encoders yet?
Anonymous No.106295141
>actually attempting to assist the retard NEET
Anonymous No.106295142
>>106295129
CFG is 2.
Anonymous No.106295143
>>106295060
>>106295072
even faster then flux? impossible
I will look into this
Anonymous No.106295145
>>106294824
Trust ponyanons plan, soon we will be eating good
Anonymous No.106295151 >>106295164
>>106295140
its all the same, with text encode or without it, you will have the same probability of melted hands, ugly feets, and non existant pupils
Anonymous No.106295155 >>106295161
wow v48 really does mog v50
thanks
Anonymous No.106295161 >>106295178
>>106295155
yea it looks like he fucked up some settings with the faster training for those
Anonymous No.106295164 >>106295190
>>106295151
>blablabla
I want to know if you can use custom clipG/L in reforge currently you fairy, so is this a yes or no?
Anonymous No.106295171 >>106295317 >>106295332 >>106296062
Five years later and AI still can't be used to make HD videogame anime sprites for videogames.

Took me less time to actually learn to make them in blender than the time it will take for this scam and grift to be usable for videogames.

Because is a grift, like NFT 2.0.
Anonymous No.106295178
>>106295161
apparently lodestone said that it overtrained
Anonymous No.106295182 >>106295257
>>106295087
You are in the slopest of threads.
The chroma guy shared a fucking Disney Pixar Lora and a DALL-E, what woul you expect from that?
It's obviously that they aren’t artists, just boomers with time and a decent GPU, not looking for an artistic path.
Anonymous No.106295190 >>106295201
>>106295164
im the only one who is talking to you, KYS
Anonymous No.106295201 >>106295304
>>106295190
Can you use culstom clipL/G files in reforgeUI? Yes/No?
Anonymous No.106295207
why does fp8 quen give me random noise but distilled q8 works perfectly fine? even disabled the lora and it still does it.
Anonymous No.106295216
>>106294682
Anonymous No.106295223 >>106295414
>>106295056
>>106295032
>>106294997
Anonymous No.106295241
>>106295135
So salty
Anonymous No.106295257
>>106295182
you just aren't enlightened, anon. think about it: chroma won't randomly generate slop instead of a photo if you use a lora to generate slop 100% of the time
Anonymous No.106295266 >>106295274 >>106295512
Does qwen do vhs still, vhs footage good out of the box?

Chroma lora:
>>106158465
Anonymous No.106295274
>>106295266
qwen is like regular flux, very locked in to a few styles
Anonymous No.106295288 >>106295350
make one sharp, non fried chroma human
Anonymous No.106295293
>>106295109
>>106295129
change to euler simple
Anonymous No.106295296
wan anons who don't use lightx, how many steps are you going for? 25? 30? 40?
Anonymous No.106295304 >>106295320
>>106295201
KYS
Anonymous No.106295310 >>106295326 >>106295327 >>106295364
https://huggingface.co/lodestones/chroma-debug-development-only/tree/main/HD
so when will we finally get the actual complete version
Anonymous No.106295317
>>106295171
You're retarded but also based.

I also wish there would be base models trained for this purpose, but there aren't because it's a niche use, and training loras for the purpose gets you closer but not enough.

I've been helping a buddy train on his graphics for his indie games, as a concept / idea factory it is great, but it's really hard to get it close to the finalized graphics stage, which is what we've been trying for a while.

He fully expects having to redraw them of course, but you want the characters in poses to be as close to finalized as possible, so that it's very quick to just clean them up.
Anonymous No.106295320 >>106295357
>>106295304
Can you use custom clipL/G files in reforgeUI? Yes/No?
Anonymous No.106295326
>>106295310
when it's done i guess. he's also working on something called reflow and something called radiance
Anonymous No.106295327
>>106295310
when you level up your product and yourself as a man.
Anonymous No.106295328 >>106295335 >>106295337 >>106295341 >>106295343 >>106295349 >>106295351
how much longer a lifespan does illustrious have.... six months? more?
Anonymous No.106295332
>>106295171
>five years of progress he was here for and still is this low iq
quite a grim realization that most "people" in this world are dumber than a 1b llm
Anonymous No.106295334
>>106294384
>nice, workflow?
thanks.
boom prompt with flux-dev :
Being and Time , suffused by a sensibility derived from secularized Protestantism, The human condition is portrayed as "essentially a curse. with emotionally laden concepts guilt, conscience, angst and death.
stark minimalism
digital medium, surrealism, minimalist, modern, , , high contrast, sleek, futuristic, sleek lines, floating objects, , glowing accents, clean lines, abstract design, asymmetrical composition, high resolution
shapes and shadows. a sense of depth and dimension. surreal,
geometric abstract art, , high contrast, digital medium, overlapping circles and rectangles, intricate grid patterns, black ink splatters, minimalist, modern, angular shapes, central black circle with smaller circles, white and grey tones, clean lines, abstract composition, artistic, contemporary design, no people, no text, abstract background, visual complexity, black and white, geometric shapes, modern art, abstract style
by Ivan Chermayeff, minimalist,
a sense of depth and dimension.
Constructivism,
Suprematism,
exoplanet landscape, flat horizon
Steps: 31, Sampler: Euler, Schedule type: Simple, CFG scale: 1, Distilled CFG Scale: 3.5, Seed: 2352761364, Size: 1472x896, Model hash: 4610115bb0, Model: flux1-dev, Version: f2.0.1v1.10.1-previous-669-gdfdcbab6, Module 1: ae, Module 2: clip_l, Module 3: t5xxl_fp16
Anonymous No.106295335
>>106295328
forever
Anonymous No.106295337 >>106295595
>>106295328
sdxl in general will probably still be the most widely used model for a long while due to the fact that most people are vramlets who won't wait 1-2 minutes to gen images
Anonymous No.106295341 >>106295365
>>106295328
gonna goon to it until the retirement home
Anonymous No.106295343
>>106295328
exactly two more weeks
Anonymous No.106295349
>>106295328
Isn't it the model where there's the largest amount of loras released per day on Civitai, like it's not even close ?

Never used it, but it sure seems popular.
Anonymous No.106295350 >>106295385
>>106295288
https://files.catbox.moe/6m3n3z.png
Anonymous No.106295351
>>106295328
neta lumina already killed it anon
Anonymous No.106295357 >>106295373
>>106295320

in reforge you can use lora ctrl its a great tool to use different loras in differents steps
Anonymous No.106295364 >>106295383
>>106295310
can anyone confirm if these are actually better than v48 & v50? what metrics are you using to definitively prove it's better?
Anonymous No.106295365
>>106295341
So a couple more days then, grandpa

Bazinga!
Anonymous No.106295367 >>106295413
Anonymous No.106295370 >>106295376 >>106295381 >>106295386 >>106295401 >>106296626
hehe
Anonymous No.106295372 >>106295403
Anyone has the latest nvidia drivers? Anything broken?
Anonymous No.106295373 >>106295392 >>106296285
>>106295357
Can you use custom clipL/G files in reforgeUI? Yes/No?
Anonymous No.106295376
>>106295370
computer, now regenerate the same workflow except change the bat with a BWC
Anonymous No.106295381
>>106295370
Damn, she's a tank.
Anonymous No.106295383
>>106295364
some anons said they saw improvements. i would just wait until it's out if i were you
Anonymous No.106295385
>>106295350
what the hell is that workflow
Anonymous No.106295386
>>106295370
based
Anonymous No.106295392 >>106295397
>>106295373
Are you Forge Anon, how did it work everything?
Anonymous No.106295397
>>106295392
Can you use custom clipL/G files in reforgeUI? Yes/No?
Anonymous No.106295401
>>106295370
kek, didn't see that one coming
Anonymous No.106295403
>>106295372
>Anything broken?
nah they chill, king fr
Anonymous No.106295413 >>106295913
>>106295367
Diablo cinematic feel
Anonymous No.106295414
>>106295223

Best post since the sticky MP4.
Anonymous No.106295428
Anonymous No.106295433
I really cant believe we still dont have a basic solution to keep video quality over infinite time by just having some kind of an anchor image every generation is compared to and pushed towards
Anonymous No.106295455 >>106295555
>>106294982
Not just for goon stuff, but also a large subset of aesthetic semi goon images are just not possible with Qwen. The model may get a lot of flak from haters, but it's here to stay.
Anonymous No.106295467 >>106295477 >>106295489
Anonymous No.106295477 >>106295489 >>106295503
>>106295467
Anonymous No.106295486 >>106295494
Anonymous No.106295489
>>106295467
>>106295477
stalker
Anonymous No.106295494
>>106295486
braaaaap~
Anonymous No.106295497 >>106295517 >>106296311
I don't get inpainting. How do you get sharp results from the inpaint? Everything I inpaint feels a bit blurry or smudgy. Using forge and genning illustrious slop. Like let's say I manage to fix a hand's pose/finger count but the result isn't as high quality as the rest of the image. How do I refine the inpaint to match the detail level?
Anonymous No.106295503 >>106295524
>>106295477
Anonymous No.106295512
>>106295266
I feel like I’ve seen her in my dreams
Anonymous No.106295517 >>106295794
>>106295497
soft inpainting
Anonymous No.106295523
Mmmm, yummers
Anonymous No.106295524
>>106295503
nicer juice:
Anonymous No.106295540 >>106295570
Anonymous No.106295555
>>106295455
those feet are just so gross, I mean, no offense, but there’s no way I’m buying chroma with these feet pics!
Anonymous No.106295570 >>106295627
>>106295540
Not those feet pics again! They’re terrible, like mangled toes, I might actually vomit! Stop anon! this isn't funny!
Anonymous No.106295595
>>106295337
>wont wait 1-2 minutes
It takes me 12 seconds for a 1920x1080 and that feels like it’s too long already. But I’m on a 5070ti already I can’t afford more kek
Anonymous No.106295608
>but wait, didn't...
>but wait, wasn't...
>but wait...
Is this common for MoE thinking models or unrelated? Anyway, GLM4.5V with reasoning fucking sucks dick. I created my own vision benchmark on my business docs and it failed horribly in every way possible. Did the same benchmark with Gemini Pro 2.5 + Thinking and it aced it, even making me realize that one of the test answers in the benchmark was wrong because I missed critical information in one of the rows. Maybe not fair to compare a 108B model against 800B or whatever Gemini2.5Pro is, but since this is one of the top opensauce VLMs right now, why even fucking bother with local. seriously.
Anonymous No.106295616
>>106295044
>i need 200 words, I'm cultured
a few sentences with tits in all variations is enough for me.
Anonymous No.106295627
>>106295570
>anti-feetpic troon is back
Statler/Waldorf No.106295629
>>106295109
fix what? its the best post in \ldg\ in a LONG time!
BEAHGAHAHAH
Anonymous No.106295648
Anonymous No.106295654
Give 'em a foot, they'll make a rile.
Anonymous No.106295662 >>106295692 >>106295705 >>106295845 >>106295979
I want to generate plaphogs what’s the best model for that. All I see you guys ever make is skinny white girls and bugwomen
Anonymous No.106295692
>>106295662
personally, I like hyper bimbos
Anonymous No.106295705 >>106295720
>>106295662
I'm pretty sure they can all do it, you just need the right prompt keywords.
Anonymous No.106295720 >>106296382
>>106295705
You would assume they all can but I wonder if there were enough fuckable fat women in the data instead of just popular Instagram models. Unless I fundamentally misunderstand how this all works
Anonymous No.106295748
Anonymous No.106295760
Anonymous No.106295794
>>106295517
I tried it with this guide https://stable-diffusion-art.com/soft-inpainting/ but the results were an even blurrier mess.
Anonymous No.106295802
anime style Miku Hatsune on a building sized billboard advertisement for fruit juice in Akihabara, Tokyo. Miku is holding a green leek vegetable and wearing a shirt and skirt made of leek vegetables. On her arm is the text "01" in red text. full body view.
Anonymous No.106295805 >>106295829 >>106295850 >>106295956 >>106296686
chroma fares better in this comparison.
Anonymous No.106295809
Finally. Youtube psytrance thumbnail.
Anonymous No.106295829 >>106295850 >>106295897 >>106295956 >>106296035
>>106295805
qwen with 8step lightning lora at 8 steps, 1 cfg, + ultrareal lora.
the background is much more coherent and detailed. the girl I'd say is about equal.
Anonymous No.106295835 >>106295893
Anonymous No.106295841 >>106295880
What cfg settings do you guys use for wan2.2 with lightx2v? Seems the default is only 2 cfg for the high noise and 1 cfg for the rest?
Anonymous No.106295845 >>106295882 >>106295902 >>106295983
>>106295662
>“This is Katia. She’s on my team. I’d advise not making her angry in meetings. Her spreadsheets trap the souls of those who ignore deadlines.”
Anonymous No.106295850 >>106295897
>>106295805
>>106295829
not the chroma schizo, but qwen is looking too sharp/saturated, chroma is giving a more natural feel
Anonymous No.106295865 >>106295897
>try qwen_image_fp8_e4m3fn.safetensors
>get this
why does distil work fine but regular doesnt? comfy is updated
Anonymous No.106295880
>>106295841
I use cfg 1 for both, but I run the wan2.2 lightning lora and the 2.1 lightx2v for both the high and low noise models. For the 2.2 lora I run it at 1 strength, for the 2.1 I run it at 3 strength for 1 step and 2 strength for 1 step on the high noise model and at 1 strength for both steps on the low noise model. Overall doing 4 steps on 1cfg.
Anonymous No.106295882
>>106295845
Saw a bitch like that come pick up her husband at work the other day in yoga pants and a tight tee shirt. Popped a half chub which almost never happens at work. What’s wrong with me bros? Objectively gross but muhdick
Anonymous No.106295893
>>106295835
brilliant. yes.
Anonymous No.106295897 >>106295903 >>106295945 >>106295956
>>106295829
same, sans ultrareal lora. main difference here is way more artificial lighting, slightly worse texture on shoe and skin. background looking a bit more like a render.

>>106295850
a fair assessment, though some of this is due to the prompt:
underexposed outdoor scene, raw unedited amateurish candid photo.
a cute Japanese woman wearing a frilly pink maid dress, white pantyhose, shiny pink leather shoes, fake cat ears, long black hair, making a heart shape with her hands. she's standing on one leg, her other leg up behind her, knees together. she's smiling playfully, looking at viewer.
the background is the street entrance to a maid cafe in Akihabara district, Tokyo.
mid day, natural dramatic lighting, high contrast image. amateur quality, candid style

one could argue chroma is failing to adhere to some of the style prompt here.

>>106295865
you might need to turn off sageattention if you have that enabled
elf-hugger No.106295902
>>106295845
Terrifying, imagine her sharp nails digging into the base and stem while she tells you how to do your job.
Anonymous No.106295903
>>106295897
hm sage works fine with the gguf q8 distilled one, interesting
Anonymous No.106295913
>>106295413
*debo
Anonymous No.106295927
How snakeoily are the Dfloat11 models?
Anonymous No.106295945 >>106296072
>>106295897
with ultrareal lora, non lightning, 4 cfg, 20 steps. the lightning lora definitely biases towards a higher contrast look compared to this, pretty typical of low step loras. but I kind of prefer it in this case, and the gens take 1/4 of the time.
Anonymous No.106295956 >>106296009 >>106296024
>>106295805
>>106295829
>>106295897
the chroma output looks like artificial fill lighting on the maid, and the qwen outputs look like particularly good natural lighting based on the direction and contrast.

the background on chroma is complete trash and unusable. the qwen backgrounds are glitchy but could be photoshopped easily enough.

I think qwen (both examples) take this one.
Anonymous No.106295972 >>106296001
a closeup of a rectangular puzzle of Miku Hatsune on a wood table. On her arm is the text "01" in red text.
Anonymous No.106295979
>>106295662
Chroma is still your friend. It understands all kinds of words related to volume, from thicc to voluptuous
https://files.catbox.moe/ij3tym.png
https://files.catbox.moe/tiqe9p.png
https://files.catbox.moe/vdmvvw.png
Anonymous No.106295983
>>106295845
I can make chubsters plowsows and plaplaplaphogs all day in the anime models like illustrious and its mixes, but realism never seems to work right for me. But I’m an impatient vramlet that hasn’t stepped beyond sdxl based models lel. I tried running flux and it took like two minutes for one image, fuck that noise. Probably didn’t configure it right at all but still. I’ll try it once forge classic finally rolls it out. if I can’t download model and click “give me 1girl”, why bother?
Anonymous No.106295998
Excuse me, but I don't understand. You generate images and videos non-stop without any meaning, just making comparisons of which is better and which is worse. Don't you look for anything beyond that?
Anonymous No.106295999
On the topic of speeding up Chroma, this is looking pretty good:
https://github.com/silveroxides/ComfyUI_SigmoidOffsetScheduler
I'm getting decent results at as low as 15 steps.
Anonymous No.106296001
>>106295972
Anonymous No.106296009 >>106296044
>>106295956
>the background on chroma is complete trash and unusable
According to whom? That is a perfectly fine background, and you can modify the prompt to blur it just like Qwen.
Anonymous No.106296024 >>106296035 >>106296057 >>106296166
here's a more challenging example.

>>106295956
agreed on the lighting, though qwen really really likes to do artificial lighting on humans too. actually, chroma doesn't even give her a shadow in that specific one, though I guess that makes sense for diffuse lighting? qwen also isn't giving much of a shadow in this more complex example.
Anonymous No.106296035 >>106296103
>>106296024
>>106295829
Why does the ground look like cut styrofoam?
Anonymous No.106296044 >>106296445
>>106296009
The text and the graphics on the sandwich boards are badly melted. The Qwen text isn't totally sensible, but it's usable. You could edit it to clean up all the "AI tells" easily. The reflections on the glass in the Chroma image look suspect too.
Anonymous No.106296055 >>106296307
tried training an artist lora for qwen, was an utter waste of time.
250 images done at 3k steps, barely changes the style at all. Do I maybe need an activation tag?
Anonymous No.106296057
>>106296024
This looks like strong flash in-line with the camera so it's normal not to have a shadow. If the flash is offset more there would be a little high-contrast shadow visible on one side.
Anonymous No.106296062 >>106296875
>>106295171
Anonymous No.106296072 >>106296166
>>106295945
where is the qwen ultra real lora? this rendering is amazing
Anonymous No.106296094
Doesn't know what an EVA is.
Anonymous No.106296103
>>106296035
trowel and error
Anonymous No.106296166 >>106296263 >>106296330 >>106296343 >>106296501
>>106296024
and here's where chroma completely falls apart, at both 1MP res or higher. best of 6 too, the others far worse, while qwen gets it right every time. chroma simply can't handle a complex foreground and complex background like the one specified here, with complex building architecture and human crowds. MAYBE this could be fixed with the upcoming releases... to be fair, I'd be surprised if any model can handle this as well as qwen does. anyone want to test this on Flux Dev, HiDream, or SAAS models like MJ or 4o?
>underexposed outdoor scene, raw unedited amateurish candid photo.
>a cute Japanese woman wearing a frilly pink maid dress, white pantyhose, shiny pink leather shoes, fake cat ears, long black hair, making a heart shape with her hands. she's standing on one leg, her other leg up behind her, knees together. she's smiling playfully, looking at viewer. She's on a sidewalk.
>the background is the street entrance to an electronics store in Akihabara district, downtown Tokyo city. Other shop entrances, a busy intersection with hundreds of pedestrians crossing the street, and a giant LED billboard depicting a cartoon Chibi anime astronaut are visible in the distance.
>mid day, natural dramatic lighting. amateur quality, candid style

>>106296072
https://civitai.com/models/1662740/lenovo-ultrareal
this one is also worth trying:
https://civitai.com/models/1869530/qwen-imageemotional-photography
Anonymous No.106296175 >>106296204
Miku Hatsune putting a puzzle together on a table, the puzzle is a rectangular puzzle of Miku Hatsune on a wood table. On her arm is the text "01" in red text.
Anonymous No.106296180
>>106294799
This.
Anonymous No.106296204
>>106296175
better:

an anime style Miku Hatsune putting a puzzle together on a table, the puzzle is a rectangular puzzle of Miku Hatsune on a wood table. On her arm is the text "01" in red text.
Anonymous No.106296209 >>106296263
I can't keep lying to myself. Despite Qwen's obvious sloppiness I can't deny the outputs are visually more appealing that Chroma.
Anonymous No.106296227
Anonymous No.106296250
Anonymous No.106296257 >>106296266
>>106295009
get real dude
Anonymous No.106296263 >>106296330 >>106296343
>>106296166
>>106296209
i don't really blame chroma for this since qwen is a bigger model, from an established lab, and there was no way to know it was releasing, but qwen+style loras is better for everything but nsfw. if or when someone manages to make a proper nsfw finetune for qwen then i don't see anyone using chroma
Anonymous No.106296266 >>106296288
>>106296257
Can you use custom clipL/G files in reforgeUI? Yes/No?
Anonymous No.106296285 >>106296289
>>106295373
Did it do?
Anonymous No.106296288 >>106296301 >>106297078
>>106296266
no, faggot. quit asking because the answer is no
Anonymous No.106296289 >>106296452
>>106296285
Can you use custom clipL/G files in reforgeUI? Yes/No?
Anonymous No.106296301
>>106296288
Thanks, that's all I wanted to know.
Anonymous No.106296307 >>106296323 >>106296411
>>106296055
what script? ive been using musubi-tuner & include an activation tag at the front of my captions, but none of the loras needed it in the prompt as they tend to get overbaked

accelerate launch src/musubi_tuner/qwen_image_train_network.py \
--dit models/qwen_image_bf16.safetensors \
--vae models/diffusion_pytorch_model.safetensors \
--text_encoder models/qwen_2.5_vl_7b.safetensors \
--dataset_config "${DATASET_CONFIG}" \
--sdpa \
--mixed_precision bf16 \
--timestep_sampling shift \
--weighting_scheme none \
--discrete_flow_shift 3.0 \
--optimizer_type adamw8bit \
--learning_rate 1e-4 \
--gradient_checkpointing \
--max_data_loader_n_workers 2 \
--persistent_data_loader_workers \
--network_module networks.lora_qwen_image \
--network_dim 32 \
--network_alpha 32 \
--max_train_epochs 10 \
--save_every_n_epochs 1 \
--seed 42 \
--output_dir "${OUTPUT_DIR}" \
--output_name "qwen-${DATASET_NAME}" \
--logging_dir "${LOG_DIR}"
Anonymous No.106296311 >>106296339
>>106295497
photoshop- sharpen
Anonymous No.106296323
>>106296307
What are you using to make these mecha?
Anonymous No.106296330
>>106296166
>>106296263
Chroma fails at small high frequency detail in a way not seen since SD1.5. I guess it's because they also trained it at 512 for most of the time. It looks at lot like 1.5 output above its intended output resolution.
It's easy to understand the value people see in Chroma even now but its inability not to melt details is a dealbreaker for most things.
Anonymous No.106296332 >>106296371
16GB
which pick
Anonymous No.106296337
Continuing from last frame is easy for wan, but is it possible to "continue from the last 16 frames"?
So last second is conserved from older video and 4 more seconds could be generated in a more seamless way?
Anonymous No.106296339
>>106296311
Don't do that
Anonymous No.106296343 >>106296359
>>106296166
chroma can't even handle this prompt when it focuses solely on the landscape shot and has the girl removed. I welcome the chroma defender ITT to prove me wrong here and re-engineer this prompt to depict non-garbled buildings and pedestrian crowds.

>>106296263
sure, and also LORA cope can only go so far. Flux users have been coping with LORAs for a year and that's exactly why we've needed finetunes like Krea or chroma. Because you should be able to just prompt styles and mash them together on the fly. It's only due to pure retardation on the part of model trainers that we keep getting models that are so restricted in style. so chroma has a strong advantage there, and I'll keep using it.
Anonymous No.106296349
Do I need anything special to encode images as SD3 latents?
Anonymous No.106296359
>>106296343
This is my biggest issue with Chroma. Everything is kind of there but nothing is. Everything is a suggestion of a thing.
Anonymous No.106296360
Anonymous No.106296370
Where can I find wan2gp example prompts for anime?
Do I need any special models?
Anonymous No.106296371
>>106296332
14B Q8
Anonymous No.106296382
>>106295720
Too high for fat thighs bro
and too low for fat hoes
Anonymous No.106296390
Anonymous No.106296394 >>106296437
la creatura
Anonymous No.106296407 >>106296475 >>106296588
Anyone able to help me understand why my outputs look like this? Using Wan 2.2 I2V w/ Wan 2.1 vae, plus some lora.
Anonymous No.106296411 >>106296490
>>106296307
ostris AI toolkit
it isn't like my artist lora does literally nothing, it just changes it and not towards the artist style much
Anonymous No.106296437
the qwen emotional photography LORA is actually nuts. it seems to have a number of different styles baked in, with zero documentation of this capability of course other than the examples on civit.

>>106296394
you don't get it... this is flawless realism right here. chroma and AI models in general can't possibly improve beyond this point.
Anonymous No.106296445 >>106296459 >>106296502
>>106296044
It looks much better than the Qwen plastic and baked in bokeh that you never asked for.
Are you guys implying Chroma can't do a coherent background though. I mean, of course it can, what you're implying is just silly.
Anonymous No.106296452
>>106296289
Yes moron. Stfu and click on the uptagging model sequencer, menu comes up for uClip options.
Anonymous No.106296459 >>106296495 >>106296525 >>106296534
>>106296445
your selective reading abilities are admirable. I clearly just demonstrated that chroma can't do a coherent background if it has too much complexity. it can do simpler backgrounds just fine.
Anonymous No.106296463
Ahaha, the chroma shill is back.
Anonymous No.106296464
>amateur style photography is better!
>no professional style photography is better!
Anonymous No.106296475 >>106296513
>>106296407
if we had your workflow we could tell you.

4chan strips metadata from files so the .mp4 you uploaded doesn't contain the workflow.
Anonymous No.106296490
>>106296411
if i look at my lower epochs i get that- it changes it but doesn't do much, then cranking up the weight changes it more but usually with artifacts

so i'd try more steps or bump up the learning rate (remove one of the zeros) to get it to overbake

im kind of a retard with lora training though, i like to have it overbaked & then just scale back the weight when using it
Anonymous No.106296495 >>106296504 >>106296525 >>106296534
>>106296459
just lol...
Anonymous No.106296501 >>106296519 >>106296534 >>106296550 >>106296735
>>106296166
My first result is nowhere near as poor as what you're posting. I do think that you're a troll and I'm not sure I can take you seriously
https://files.catbox.moe/ii8qlr.png
Anonymous No.106296502
>>106296445
You gave it a softball and it still blew it on the pebbles and overhanging cables
Anonymous No.106296504
>>106296495
>Posts objectively worse mage
>actually, here's a better image.

Chroma cope is at an all time high.
Anonymous No.106296513 >>106296523
>>106296475
My bad anon, good catch. here's the raw workflow itself I imported from some other thread. Do note the steps are increased in this image from some testing but typically stay around 6 for each sampler.

See anything stupid here? Your help will contribute to more miku.
Anonymous No.106296519 >>106296552
>>106296501
that's terrible and you didn't even choose a background with a lot of people in the distance like OP
Anonymous No.106296523 >>106296580 >>106296599 >>106296752
>>106296513
Miku overtook my file upload, im a moron - here's my workflow
Anonymous No.106296524
this time, qwen q8 (gguf) non distilled, with 8 steps lora, 2.5cfg (default/recommended)

an anime style Miku Hatsune putting a puzzle together on a table, the puzzle is a rectangular puzzle of Miku Hatsune on a wood table. On her arm is the text "01" in red text.

supposedly it's better quality but you can do 1.0 for faster gens
Anonymous No.106296525 >>106296550
>>106296459
>slop but coherent
>>106296495
soul but messy
Anonymous No.106296534
>>106296459
>>106296495
>>106296501
qwen won
Anonymous No.106296550 >>106296561
>>106296501
>have to double the prompt length to compensate for model shortcomings
>still no large crowd in background
>still melted buildings
>background figures are melted, walking into walls, standing ominously, floating dismembered legs...
admittedly you improved it a bit, but you failed overall.

>>106296525
I disagree with calling these qwen outputs slop. Compare them to a base Flux or Pony Realism output, now THAT is slop.
Anonymous No.106296552 >>106296577
>>106296519
Your idea of what this model is capable of comes from flawed assumptions. You have been filtered the same way that Qwen anon has, assuming only Qwen can do blurred (not sharp) neat backgrounds.
Anonymous No.106296561
>>106296550
just because its not as slop as base flux or pony doesnt mean its not slop at all, anon
Anonymous No.106296571
>>106296570
>>106296570
>>106296570
Anonymous No.106296577 >>106296585
>>106296552
Last one, you guys are actual clowns. I mean, if you want to impress me with Qwen, show me something looks sharp, not blurred and bokeh'd, hiding imperfections.
Anonymous No.106296580
>>106296523
might be the wrong lightx2v lora, i think there's a separate one for I2V instead of T2V.
Anonymous No.106296585 >>106296646
>>106296577
I lied. Just got Chroma equivalent of what you posted.
Anonymous No.106296588
>>106296407
call it lo-fi aesthetic and put it on civitai as early access to milk some chuddie money
Anonymous No.106296599
>>106296523
oh wait a sec, the bigger issue is you set steps to 25. it needs to be aligned with those start_at_step, end_at_step variables (double the 3). so it should be set to 6 steps.
Anonymous No.106296600
The last gasps of the chroma user. Spamming blurry maids on blurry nonsense streets as he realizes he is alone.
Anonymous No.106296623
so for qwen, cfg 1 is faster, but 2.5 cfg is recommended, how much is 1 hurting quality in general? seemed ok for the distilled quen models.
Anonymous No.106296626
>>106295370
KEK
Anonymous No.106296646
>>106296585
2nd seed. Oh wow, I'm so impressed, Chroma is SOTA because it can get coherent blurred backgrounds right! Not like this hasn't been the case since base Flux!
Anonymous No.106296686
>>106295805
wow chroma crushed another digital camera photo of an asian girl in overcast lighting, I hope someone is keeping track of this
Anonymous No.106296694
Anonymous No.106296714
Threadly reminder that Chroma is meant to be used with Loras. If you use the vanilla model with no loras, you are not using it with all of its potential. I strongly advise you to train a lora on the kind of images you like if you are going to use Chroma.

Qwen-Image users are already aware of this since all the base model produces is slop.
Anonymous No.106296731 >>106296763
a textbook with Miku Hatsune on the cover with the title "how to generate 1girls!". Miku is sitting at a 1990 style computer with a CRT monitor. On her arm is the text "01" in red text.

kino, it knows how to make old style PCs.
Anonymous No.106296735
>>106296501
kek wtf are those distorted faces
Anonymous No.106296752
>>106296523
Your shift values aren't the same.
Anonymous No.106296763
>>106296731
a computer textbook with Miku Hatsune on the cover with the title "how to generate 1girls!". Miku is sitting at a 1990 style computer with a CRT monitor. On the screen of the monitor is a chibi version of Miku Hatsune, in monochrome. On her arm is the text "01" in red text.
Anonymous No.106296875
>>106296062
nice
Anonymous No.106297078
>>106296288
quit being such a let down bro. He was asking a simple question. How should he have known. There's very limited information about the different tools capabilities and the limitations.
Anonymous No.106297086
The chroma examples vs qwen look obviously visibly worse. Like extremely so. Sure, it doesn't apply aggressive DoF to the background like qwen, but what it does have is a background that looks like my nightmares. I legit cannot understand how this anon is thinking chroma is producing something better.