← Home ← Back to /g/

Thread 105748241

325 posts 266 images /g/
Anonymous No.105748241 [Report] >>105748286 >>105748802 >>105753166
/ldg/ - Local Diffusion General
And There Will Be More Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>105745833

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Models, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info

>Cook
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX (video)
Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1

>Chroma
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate

>Neighbors
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/celeb+ai
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.105748254 [Report] >>105748286
Blessed thread of frenship
Anonymous No.105748269 [Report]
Anonymous No.105748280 [Report] >>105748317
>>105748252
I mean in general when it comes to NSFW loras for any model, there's always typically some that are clearly trained by a dude who only cares about one race of chick, and some trained by a dude who actually made effort to make it versatile in that regard. The most "biased" ones are usually "towards" Asian chicks
Anonymous No.105748286 [Report] >>105748298
>>105748254
still no neighbors list update
rude.
>>>/vp/napt has SOUL.
>>105748241 (OP)
baker chose some really horrible shit for the collage (again) haha
i DO hope its randomized

anyways,
SMELL YA LATER
Anonymous No.105748287 [Report]
the cartoon man on the left is standing apart from the cartoon man on the right. the man on the right is standing and looking to the right. the man on the left has a speech bubble saying "NO WAY, FAG".
Anonymous No.105748295 [Report] >>105748300 >>105748308
these threads would've been much better if 4chan didn't strip metadata from png's
Anonymous No.105748298 [Report] >>105748322
>>105748286
>i DO hope its randomized
they are never randomized otherwise you'd get random shit like screenshots and semi-NSFW pics which could get the thread nuked

someone hand picks each one individually every single time.
Anonymous No.105748300 [Report]
>>105748295
>filter by checkpoint or prompt
kino
Anonymous No.105748304 [Report] >>105748422
Don't give in to temptation, anons!
Anonymous No.105748308 [Report] >>105748994
>>105748295
yeah, why does it even do that
Anonymous No.105748314 [Report] >>105748348 >>105748356 >>105748450
Is ComfyUI-nunchaku worth it? Is 4fp quantization detrimental to the quality of diffusion? I'm on 12gb.
Anonymous No.105748317 [Report]
>>105748280
>>105748252
>most do have roast beef
not really interested in debating this w\
gross LDG posters on fuggin 4chan bwahaha
>>105748259
in a few more months custom vagina loras will let you make as much arbys as you want
but pls, TRY to remember, its fake & gay.
<3
Anonymous No.105748322 [Report] >>105748327
>>105748298
so the baker is a spiteful faggot then
mental illness
Anonymous No.105748324 [Report] >>105748334
>caring about the faggollage
Anonymous No.105748327 [Report] >>105748340 >>105748341
>>105748322
i think that guy is also your "schizo / janny"
what do you think?
Anonymous No.105748334 [Report]
>>105748324
>being a schizoid
Anonymous No.105748340 [Report]
>>105748327
the data supports it kek
Anonymous No.105748341 [Report]
>>105748327
obv
Anonymous No.105748348 [Report]
>>105748314
no
save up for a real gpu.
Anonymous No.105748352 [Report] >>105748380 >>105748400 >>105749147
Anyone know why I have melty faces with Wan 2.1? I'm using the FusionX model, RifleX thing, and a single lora, and faces just kinda melt into goop. Sometimes they're "fine", but other times they completely shit the bed.
Disabling the loras doesn't seem to affect it much.
I know that the FusionX model can have issues with faces, but the rentry had me under the impression that features could change (like the face going off model), not necessarily turning faces into slime.
Anonymous No.105748356 [Report] >>105748416
>>105748314
its worth it bro
t. 3060 enjoyer
i posted speeds in last thread or before last thread

svdquant speeds up generation time:
RTX 3060 12GB @100W:
clip offloaded to cpu, flux in gpu fully:
100%|| 8/8 [00:24<00:00, 3.04s/it]
-total gen time:
after prompt change: 35 seconds
2nd gen: 25 seconds
clip device default, flux auto offload:
100%|| 8/8 [00:25<00:00, 3.15s/it]
-total gen time:
26s in both cases

@170w:
19s - 8 steps
29s - 20 steps
Anonymous No.105748380 [Report] >>105748417 >>105748580
>>105748352
Can you post an example?
Anonymous No.105748391 [Report]
that kontext clothing remover is nuts

like I get that you can do better manually but jfc you can feed a folder of photos to this thing and an hour later everyone is nude

it even seems to understand concepts like leaning over, fat vs skinny, front vs back vs side, can do entire crowds of people at once, and knows not to give men tits (but still gives them a big mangina)

world ain’t ready for this shit
Anonymous No.105748400 [Report] >>105748417 >>105748580
>>105748352
neg: changing face, warping face, ugly face, crooked teeth, ugly, asian
even kling will hardfuck a face 1\10 times anon
think of it like baseball, you cant "win" every gen but you try to "win" a lot

disable teacache if you can
Anonymous No.105748410 [Report] >>105748437
remove the characters from the image. change the text from "hotel" to "LDG".
Anonymous No.105748412 [Report] >>105748437 >>105749032 >>105749188 >>105751228
kontext pixel art lora (with perfect grid alignment) first attempt is cooking..
Anonymous No.105748416 [Report] >>105748425
>>105748356
should I use int4 or fp4 models?
Anonymous No.105748417 [Report] >>105748470 >>105748503 >>105751040
>>105748380
Yeah, gimme a moment. Everything is NSFW, so I'll grab some shit off of danbooru, and try to get an example.
I'm thinking it's the FusionX model though, as the normal Wan model keeps things stable. Only problem is base Wan keeps shit really stiff and slow motion (even with the workflow in the rentry).

>>105748400
I'll give those a try. As for TeaCache, I'm not using that. I'm using the LightX2V thing, so Teacache shouldn't be in that.
>think of it like baseball, you cant "win" every gen but you try to "win" a lot
I know it's gacha, but it definitely seems more like something going wrong than faces just getting bad rolls.
Anonymous No.105748422 [Report] >>105748482 >>105748641 >>105749163
>>105748304

Chroma can't do this
Anonymous No.105748425 [Report] >>105748442
>reveal her stomach
okay man.. you win
>>105748416
int4 on pre rtx 5000 cards
Anonymous No.105748437 [Report] >>105748462 >>105748481 >>105748508
>>105748410
replace the tiles in the picture with ice and snow.
>>105748412
looks good
Anonymous No.105748442 [Report] >>105748464 >>105748493
>>105748425
I'm on 5070. I also noticed --fast degrades quality even in fp8 models for some reason. Anyway thanks, I'll give it a shot.
Anonymous No.105748450 [Report] >>105748470
how many bakers do we have? excluding the migguman.
>>105748314
it is absolutely worth it. you can finetune the speed at the cost of a quality loss with the cache threshold slider. 0.12 makes to go brr, 0.06 seems like a good spot. in general, there is a loss in quality, that's just how it is. the installation can be a huegpain tho, gl. but go for it - I get a 25 step gen in like 8 seconds.
Anonymous No.105748462 [Report]
>>105748437
Not bad actually, at least for quick conceptualizing, you could be fooled into thinking this was a actual magazine screenshot of Earthbound
Anonymous No.105748464 [Report] >>105748775
>>105748442
yw anon, post speeds if u get it to work
i wonder how much worse my 3060 is compared to a 5070
Anonymous No.105748470 [Report] >>105748503
>>105748417
try vanilla wan2.1 and see if it fugs up
are you img2v or text2v
i have NEVER gotten good results with text2video
>>105748450
shes pretty
Anonymous No.105748481 [Report]
>>105748437
the image is in the style of a nintendo 8-bit videogame.

made the sprites less detailed, neat
Anonymous No.105748482 [Report] >>105748507
>>105748422
Can't do what?
Anonymous No.105748493 [Report]
>>105748442
use fp4 model, it's more accurate than int4
Anonymous No.105748503 [Report] >>105748515 >>105748540
>>105748470
>try vanilla wan2.1 and see if it fugs up
See >>105748417
>I'm thinking it's the FusionX model though, as the normal Wan model keeps things stable.

I'm doing i2v, by the way. t2v doesn't interest me at all. I don't see the point in fucking around with prompting the perfect scene/character/etc for the video on top of all the action when I can do that better with whatever image model I want and start with that.
Anonymous No.105748507 [Report] >>105748538 >>105749163
>>105748482
Elizabethan engravings featuring basic 4chan themes
Anonymous No.105748508 [Report]
>>105748437
>looks good
thx. once i'm happy with results i will share. but might take a few days of experimental training because it's so slow.
Anonymous No.105748510 [Report] >>105748521 >>105748531
okay, this is pretty interesting:

replace every character with Miku Hatsune.

neat that it can pick up on each sprite/character, and distinguish them from the buildings.
Anonymous No.105748511 [Report] >>105749695 >>105749699
Anonymous No.105748515 [Report]
>>105748503
>t2v doesn't interest me at all. I don't see the point in fucking around with prompting

nta

a valid point
Anonymous No.105748521 [Report] >>105749037
>>105748510
replace every character with a pixel art version of Miku Hatsune.
Anonymous No.105748529 [Report] >>105751537
>>105747995
>https://files.catbox.moe/t775eh.png
does this work for anyone else?
Anonymous No.105748531 [Report]
>>105748510
FUCKING KEK
Anonymous No.105748538 [Report]
>>105748507
I'm pretty sure that gen was made with Chroma
Anonymous No.105748540 [Report] >>105748599
>>105748503
>t2v doesn't interest me at all.
agreed, i don't really get the point of t2v and never even downloaded or tried the models
Anonymous No.105748555 [Report]
replace the red car in the middle with a teal color car driven by Miku Hatsune.

the model is very good at picking stuff up. could have done the red truck but didnt. although that got a color swap.
Anonymous No.105748580 [Report] >>105748623
>>105748380
>>105748400
Fuck it, now it's "working" with the shit I was trying to use from danbooru as an example.
Here's one of my fucked outputs.
It's NSFW so head's up before you click.
https://files.catbox.moe/ysdpof.webm
Anonymous No.105748591 [Report] >>105748609
change the location to Akihabara, Tokyo in pixel art style. keep all the characters in the same location.

cool.
Anonymous No.105748599 [Report] >>105748627
>>105748540
what really cooks my noodle is SOME of the lora will work with both img2v and t2v simultaneously (despite being trained for specifically one or the other)
if i had a stronger pc build i would try to train as well...
Anonymous No.105748609 [Report] >>105748627 >>105748629
>>105748591
and it gets even better.

change the location to a convenience store in the desert, in pixel art style. outside the store is a sign that says "SNEEDS FEED AND SEED". keep all the characters in the same location.
Anonymous No.105748623 [Report] >>105748667
>>105748580
the warble in the face is what happens on my machine if i turn teacache up too high (usually 1.5\1.75x)
for simpler cartoon subjects i can crank to max and usually have no issues
the pubes were gross kek
Anonymous No.105748627 [Report] >>105748647
>>105748599
>if i had a stronger pc build i would try to train as well...
rent an A40 on runpod! it's $0.40 per hour
i have two 3090s at home but do all my training that way especially in the summer it doesn't heat up your house

>>105748609
nice to see another pixel art enjoyer
Anonymous No.105748629 [Report]
>>105748609
s a v e d
i plan on getting banned from \vr\ with this image kek
the jannies there are such fuggin dicks
Anonymous No.105748638 [Report]
there we go

change the location to a convenience store in the desert, in pixel art style. outside the store is a sign that says "SNEEDS FEED AND SEED". Below that sign is a sign that says "(formerly chucks)". keep all the characters in the same location.
Anonymous No.105748641 [Report] >>105748652 >>105748660 >>105748692 >>105750202 >>105750252
>>105748422
i had to censor her vajayjay but yeah it can lol
Anonymous No.105748647 [Report]
>>105748627
i'll look into it
if i can get around 30-40 videos cut\cropped up properly
i can go public and release a team rocket rainbow hair wanvideo lora :)
Anonymous No.105748652 [Report]
>>105748641
basado
Anonymous No.105748660 [Report] >>105748692
>>105748641
regular Flux even kinda sorta gets closeish for that matter on the exact same promp
Anonymous No.105748667 [Report]
>>105748623
I'm not using teacache though. I'm using the LightX2v lora/NAG self attention shit (at 0.6 str as mentioned in the Rentry when using the RifleX model).

>the pubes were gross kek
Fair enough, it's just an example of the fucked face. Though to be fair I kinda figure Power would have much more going on down there.
Anonymous No.105748685 [Report]
Change the location to the surface of the moon, in pixel art style. Keep all the characters in the same location. In the background is the Earth in pixel art style.

kontext is really good at detecting elements based on prompts and also transformations/styles, pretty fun
Anonymous No.105748692 [Report] >>105748770
>>105748641
>>105748660
Original pic in question was made with Chroma though. Flux nor Kontext know the style, and if you're editing with Kontext it won't gen sex toys or panties flash.
Anonymous No.105748746 [Report] >>105748794
The image is projected on a CRT TV screen in a man's bedroom. A man wearing a suit is holding a SNES controller connected to a game console, and looking at the screen.
Anonymous No.105748770 [Report]
>>105748692
chroma so good ("sad but true" from metallica slowly cueing in. fuck I hate metallica). I mean glad we have it.
Anonymous No.105748775 [Report] >>105748787
>>105748464
I think I've got it working. Seems to look ok, and almost twice as fast as the fp8 checkpoint, nice.
https://imgsli.com/MzkzNTYw
Anonymous No.105748787 [Report] >>105748819
>>105748775
speed? can you post the full workflow so i can do a test, i wanna compare int4 vs fp4
Anonymous No.105748794 [Report]
>>105748746
ok horrible
Anonymous No.105748802 [Report]
>>105748241 (OP)
more konoha chads!? spoon feed me
Anonymous No.105748810 [Report] >>105748818 >>105749105
promotions...denied.

A man is folding his arms and looks upset. the background is white.
Anonymous No.105748818 [Report]
>>105748810
my promotion...gone...
Anonymous No.105748819 [Report]
>>105748787
https://files.catbox.moe/bcv0uh.png
32 seconds, about 1.25it/s @150w

lora: https://civitai.com/models/721039/retro-anime-flux-style
Anonymous No.105748991 [Report] >>105749003 >>105749070
the green cartoon frog is sitting in a bean bag chair and watching TV in his bedroom, wearing a red shirt and blue shorts. On the TV is a sunny beach.
Anonymous No.105748994 [Report]
>>105748308
>yeah, why does it even do that
Terrorists could communicate with image metadata.
Anonymous No.105749003 [Report] >>105749023 >>105749070
>>105748991
the green cartoon frog is sitting in a bean bag chair and watching TV in his bedroom, wearing a red shirt and blue shorts. On the TV is a sunny beach. keep the same expression.

there we go, now it kept the img source face.
Anonymous No.105749023 [Report] >>105749042
>>105749003
lawnmower toes
Anonymous No.105749032 [Report] >>105749062
>>105748412
24GB? 768px?
Anonymous No.105749037 [Report]
>>105748521
That's more like it, pretty cool
Anonymous No.105749042 [Report] >>105749049 >>105749070
>>105749023
ok, now it's better.
Anonymous No.105749049 [Report] >>105749070
>>105749042
except his neck was bad

now it's decent.
Anonymous No.105749062 [Report] >>105749326
>>105749032
1024px, training on rented gpu since 24G isn't enough for that
Anonymous No.105749070 [Report] >>105749129
>>105749042
>>105749049
>>105749003
>>105748991
4 pepes and NONE of them are using crt but laggy flatpanels
how is he supposed to play vidya like that???????
Anonymous No.105749075 [Report] >>105749099
Close enough
Anonymous No.105749087 [Report]
What you mean you deleted my 10 TB folder of nswf ai gens.
Anonymous No.105749099 [Report] >>105749104
>>105749075
nice. openpose?
Anonymous No.105749104 [Report]
>>105749099
Chroma spamming gens until it hit it right
Anonymous No.105749105 [Report]
>>105748810
>upset
Looks like a movie villain.
Anonymous No.105749129 [Report] >>105749170
>>105749070
there we go
Anonymous No.105749147 [Report] >>105749185 >>105749280 >>105750996
>>105748352
The MPS Reward LoRA merged into FusionX has a known problem with changing facial appearance. You should probably be using Lightx2v instead.
Anonymous No.105749158 [Report]
Anonymous No.105749163 [Report] >>105749263
>>105748422
>>105748507
kek, the first stage is denial...
Anonymous No.105749170 [Report]
>>105749129
fin. <3
Anonymous No.105749183 [Report]
what's the best way to remove mosaic from hgames
Anonymous No.105749185 [Report] >>105749539
>>105749147
if you post pony 1girls the schizoid will be mad at you too :c
Anonymous No.105749188 [Report] >>105749312
>>105748412
Nice, but perfect pixel alignment ? I mean even if you train on high resolution non filter scaled pixel graphics, Flux will interpolate when generating as far as I know ?

I've never come across a pixel lora which didn't need nearest neighbor scaling on the results.
Anonymous No.105749231 [Report] >>105749239
Anonymous No.105749239 [Report] >>105749305 >>105749367
>>105749231
I'm not to fond of the very high contrast in these images, but they have a very non-ai look to them, well done.
Anonymous No.105749246 [Report] >>105749284 >>105749322 >>105749332
Anonymous No.105749263 [Report]
>>105749163
>kek, the first stage is denial...
That's not true!
Anonymous No.105749278 [Report]
>still waiting for the based China man to release radial attention

We WILL escape 5 second hell
Anonymous No.105749280 [Report]
>>105749147
Well heil there, how you doing fräulein ?
Anonymous No.105749284 [Report]
>>105749246
/LDG/ ladies & gentlemen
change the screen to show whatever horrible abomination the rest of the users(shitters) come up with here and its 1:1 with actual reality
Anonymous No.105749286 [Report]
Anonymous No.105749297 [Report]
Anonymous No.105749305 [Report]
>>105749239
ill try some muted ones next
Anonymous No.105749312 [Report] >>105749342 >>105749348 >>105749357
>>105749188
it's cause people don't know how to train them. you have to have your pixels perfectly aligned and all same scaling (common is X4 or X8)
you can't just shove a bunch of randomly scaled pixel art into the training dataset, and you also have to disable bucketing or rescaling during training
yes there will be some minimal noise from the VAE but if you trained it well it means after rescale the result will be almost identical to the gen
Anonymous No.105749322 [Report] >>105749332
>>105749246
Chroma really doesn't want to output glowing brakes...
Anonymous No.105749326 [Report] >>105749413
>>105749062
Where are you training? I have some soon-to-expire Colab credits that I'd happily burn on a few Kontext loras
Anonymous No.105749332 [Report]
>>105749322
>>105749246
Oops didn't mean to quote
Anonymous No.105749342 [Report] >>105749348
>>105749312
picrel after 0.25x and 4x nearest, almost identical to the gen
Anonymous No.105749348 [Report]
>>105749312
>>105749342
nice
Anonymous No.105749357 [Report]
>>105749312
Good stuff, sounds like you know what you're doing!
Anonymous No.105749367 [Report] >>105749390
>>105749239
left is
>pos: muted color, pale color, flat color
>neg: saturated, colorful, neon palette
im sure i can find some more tags to push it further
Anonymous No.105749390 [Report] >>105749419
>>105749367
What model is this ? The style reminds me of Shigenori Soejima of Persona fame
Anonymous No.105749413 [Report] >>105749449
>>105749326
on runpod. i can afford it easily the more difficult part is dataset preparation, because flux controlnets are so dogshit. this is the first test method and i did img2img + depth controlnet with some loras i trained in the past but if i controlnet too low it changes the image too much and too high it fries the image.
chicken and egg problem, i need good examples to train this task, which don't exist
Anonymous No.105749419 [Report]
>>105749390
https://huggingface.co/Laxhar/noobai-XL-1.0
Anonymous No.105749436 [Report]
Chroma's lack of knowledge of even the most basic anime-adjacent characters like vocaloids is driving me fucking crazy. I hope the chinks will try to have their way with it and train it properly. This is suppossed to be Kagamine Rin
Anonymous No.105749449 [Report]
>>105749413
random example from my training dataset which i painstakingly seedhunted. this is really the worst part of it.
Anonymous No.105749511 [Report] >>105749752 >>105749969 >>105749988 >>105751432
Are there any LLMs made specifically for image2prompt? Bonus points for being able to run within comfy.
Anonymous No.105749539 [Report] >>105749647
>>105749185
Funny enough, that one was made with an illustrious realism model that is so shit fucked that it's pretty much pony but with exceptionally worse prompt adherence.
Anonymous No.105749584 [Report] >>105749619 >>105749624 >>105749894
wow truly 10/10 tool
Anonymous No.105749594 [Report]
Anonymous No.105749619 [Report]
>>105749584
>generated caption:
this is a image submitted by a frogposter devoid of any artistic merit and has been created using (now depreciated) Ai image generation technology, it should not be prompted for by anyone, ever again, and may God have mercy on your soul
Anonymous No.105749624 [Report] >>105749649
>>105749584
Why not joycaption?
Anonymous No.105749626 [Report]
I'm tired, boss.
Anonymous No.105749647 [Report]
>>105749539
2009 tumblr called they said "reblogged"
Anonymous No.105749649 [Report]
>>105749624
Oh that worked. Thanks.
Anonymous No.105749672 [Report]
has anyone used musubi-tuner to make a lora for chroma here? if so, do you mind sharing your configuration
Anonymous No.105749693 [Report]
Kek I tried one more i2p tool and it just fucking died. 5 minutes btw.
Anonymous No.105749695 [Report]
>>105748511
Are you using a film lora?
Anonymous No.105749699 [Report]
>>105748511
>britney spears face
Anonymous No.105749702 [Report] >>105749777 >>105749801
Anonymous No.105749743 [Report]
we got anything that uncensor kontext yet?
Anonymous No.105749752 [Report]
>>105749511
you want an LLM with "vision" capabilities. you can go the ollama route or you can use joy caption, both have comfyui nodes. I had decent success with minicpm-v (5.5gig) but it's very basic, doesn't know a thing about artists. it's ok tho for upscaling when you can't be bothered to write a prompt. joycaption is a whole lot better esp. for niche/nsfw content, 15ish gb tho.
Anonymous No.105749756 [Report] >>105749796 >>105749883
and high contrast in the negs but still not as faded as i want
Anonymous No.105749777 [Report] >>105749855
>>105749702
how 9secs? do you have image prompt?
Anonymous No.105749796 [Report] >>105751201
>>105749756
images from scratch? looks cool
Anonymous No.105749801 [Report]
>>105749702
imagine the smell
Anonymous No.105749809 [Report] >>105749816 >>105749826 >>105749865 >>105750202
hint
Anonymous No.105749816 [Report] >>105749822
>>105749809
censored to hell
Anonymous No.105749821 [Report]
So what's the best shit for generating high fantasy portraits and landscapes? It used to be Fooocus. Is it still?
Anonymous No.105749822 [Report] >>105749828 >>105749850
>>105749816
if you find an uncensored vison-capable llm that isnt called joy caption let me know lol
Anonymous No.105749826 [Report] >>105749914
>>105749809
If I wanted to incorporate this into existing workflow, is there a way to unload the LLM after it is done to not hog VRAM for the image model?
Anonymous No.105749828 [Report]
>>105749822
any gemma 3 abliteration
the problem is that abliteration makes the models retarded
Anonymous No.105749850 [Report] >>105749914 >>105749931
>>105749822
Try the WD14 image tagger. Ollama sucks ass.
Anonymous No.105749855 [Report] >>105749956
>>105749777
https://files.catbox.moe/bxjvug.png
to prompt for longer vids just increase the generated frames. the looping problem doesnt affect most actions.
Anonymous No.105749865 [Report] >>105750476
>>105749809
>artist
dont upset him now he finally left ;3
Anonymous No.105749867 [Report]
Anonymous No.105749878 [Report]
Hmm, close enough.
Anonymous No.105749883 [Report] >>105751201
>>105749756
Lowering the contrast seems to have removed the 'fringing' in the outlines, I prefer this but it's all subjective.
Anonymous No.105749894 [Report] >>105749919
Hey, got a question. I'm using Forge and diving into ControlNet.
The preprocessors are good to go, but I still need to hunt down the models.
I've got some NoobAI models, but I'm after the SDXL ones. I saw a CIVITAI link with 50 ControlNet models featuring the same ballet ballerina image, but downloading all 50 nameless models feels off.
Is there a cleaner source for these?

Another thing, do any depth v2, gold depth, or similar models work in Forge?
What's your take on Forge?
If I want to tackle more 'complicated' projects, is the UI solid?

>>105749584
don't waste time with those model, use gemini or chatgpt4o or claude it has better image vision that those 8b llama finetuned
Anonymous No.105749914 [Report] >>105749945
>>105749826
comfyui ollama, no, because you load the model with ollama via cmd (but maybe there is a way..?), joy caption comfyui, yes. got an 'unload model after you are done' option.
>>105749850
yeah I hate ollama too. right I have WD14 on my other comfy install. no idea what it does with non anime stuff tho
Anonymous No.105749919 [Report] >>105750097
>>105749894
>https://huggingface.co/xinsir/controlnet-union-sdxl-1.0/tree/main
Download the promax union model so you don't have to download a bunch of controlnet models individually.
Anonymous No.105749931 [Report] >>105749945
>>105749850
It works ok. It can't recognize styles. This pic is a direct feed of wd14 into an image generator. I recommend taking the wd14 output and pruning it, then adding your own style prompts.

https://files.catbox.moe/815btc.jpg
Anonymous No.105749945 [Report]
I tried asking ollama about freemasonry and it completely hallucinated the founder with a made up name. It also said Albert Pike had nothing to do with Freemasonry, then when pressed explained how involved he was.
>>105749931
for >>105749914
Anonymous No.105749956 [Report]
>>105749855
thx
Anonymous No.105749969 [Report] >>105750125
>>105749511
The best one is not local, it's Gemini (uncensored and free from API)

You can easily hook it up using ComfyUI, just modify the code any node connecting to API to connect to Gemini (and use the appropriate token)
Anonymous No.105749983 [Report]
Anonymous No.105749988 [Report]
>>105749511
https://github.com/pythongosssss/ComfyUI-WD14-Tagger
Anonymous No.105750008 [Report]
Anonymous No.105750040 [Report] >>105750051
Florence2 > WD14
Anonymous No.105750051 [Report]
>>105750040
for booru tags?
Anonymous No.105750097 [Report]
>>105749919
Thank you! I saw this before but thought it was for ComfiUI, not for Forge.
I’ll give it a shot. Do I need two Python dependencies? One for Forge and another for this? Thanks again!
Anonymous No.105750117 [Report] >>105750566
>>105742711
>>105742655
>it works
Hell yeah time to make some abominations
Anonymous No.105750125 [Report] >>105750202 >>105750221 >>105750252
>>105749969
ok I just made a key and signed up and shit and wow. is there a weekly/daily/monthly token limit?
Anonymous No.105750126 [Report] >>105750253
chroma, sdxl upscale
Anonymous No.105750202 [Report] >>105750237 >>105750289
>>105750125
>>105749809
funny to see how gemini outputs this overly-analytical wall of text, in comparison to how simple the prompt was:
>A black and white engraving print by English satirical engraver and cartoonist William Hogarth in the year 1600.
>A 35 year old prostitute woman with large breasts is sitting on stairs in an alley looking at viewer naughtily wearing a dress with short skirt. She's lifting own skirt to show her white silk panties. Fleshlights, sex toys, and dildos litter the stairs around her.
>The background is Gooner Lane in London, an alleyway notorious for prostitution and alcoholism. A ruined slum is visible in the distance. Fine text at top says "Gooner Lane".

>>105748641
what style prompt did you use?
I find I get best results by researching specific artists on wikipedia and then following this formula:
>media used
>school of art/artistic movement
>artist name
>era
>qualities, style details, etc
I've started learning more art history just to try and find good styles that chroma recognizes.
Anonymous No.105750217 [Report] >>105750223 >>105750224 >>105750239 >>105750286 >>105750392 >>105751105
how do you pronounce "gguf" ?
I call it double G oof
Anonymous No.105750221 [Report]
>>105750125
>is there a weekly/daily/monthly token limit
You can see the limits per day and rate when you hover over the models on AI studio. It is mostly free.
Anonymous No.105750223 [Report]
>>105750217
goof
Anonymous No.105750224 [Report]
>>105750217
gee-goof
Anonymous No.105750237 [Report]
>>105750202
Yes, well one could prompt Gemini to condense the info into a paragraph and it nails it too.
Anonymous No.105750239 [Report]
>>105750217
gg you fuck
Anonymous No.105750252 [Report] >>105750276 >>105750289
>>105748641
lmao
>>105750125
I wonder how that prompt style works when fed into flux. SD images come out basically the same with WD14 prompter, because it overloads it so much, it just werks.
Anonymous No.105750253 [Report]
>>105750126
French woman? Looks like Eiffel tower in the background
Anonymous No.105750276 [Report] >>105750293
>>105750252
Does flux use asterisks in it's syntax? It should be fine probably.
Anonymous No.105750286 [Report]
>>105750217
ge-gu-oof
Anonymous No.105750289 [Report] >>105750401
>>105750202
here, just a little vision/prompt enhance thing. I'm impressed. also, that gen is super cool. is that william hogarth again?
>>105750252
even flux has it's limits lol but here, condensed. flux and chroma can work with that np. I mean you can feed t5xxl novels but that's pointless.
Anonymous No.105750293 [Report]
>>105750276
IDK I just started messing with it because someone made a styx lora, and it's only available on flux.
Anonymous No.105750392 [Report]
>>105750217
giguff
Anonymous No.105750401 [Report] >>105750557 >>105750773
>>105750289
>use british english.
Anonymous No.105750423 [Report] >>105750445 >>105750461 >>105750465 >>105750566
reminder you can use a quick image stitch to get two images to interact without two image sources in a workflow:

man on left (does action) with man on right, etc. anime girl, object, whatever, just identify each and it works.
Anonymous No.105750426 [Report]
Anyone have a workflow for chroma detail calibrated? It's kind of producing results much worse than when I tried v30.
Anonymous No.105750436 [Report]
Flux context is perfect in removing black bars censorship in hentais. But it completely screws up against pixelation - too bad.
Anonymous No.105750445 [Report] >>105750487 >>105750530 >>105750579
>>105750423
Based. You can use a node like this one so you can do it right on your workflow.
Anonymous No.105750461 [Report]
>>105750423
forsen
Anonymous No.105750465 [Report] >>105750492 >>105750566
>>105750423
but what is the advantage? quicker processing?
Anonymous No.105750476 [Report]
>>105749865
REEEEEEEEEEE!!!!!
Anonymous No.105750487 [Report]
>>105750445
ah nice, also you can use queue selected nodes to get that to update without doing the whole workflow.
Anonymous No.105750492 [Report]
>>105750465
two image source workflow is 2x speed I think, this is default speed, either is fine but I just wanted to see if it works, it does
Anonymous No.105750518 [Report]
kek, it works well desu

the man on the left is holding a tall body pillow with an image of the woman on the right.

just took my image concatenate output and tossed it in my img input. this is nice cause I dont even need photoshop to stitch it fast, this is faster.

used image stitch output to show you input vs output:
Anonymous No.105750530 [Report] >>105750663
>>105750445
cool
Anonymous No.105750557 [Report]
>>105750401
>British "people"
Anonymous No.105750558 [Report]
the man on the left is holding a tall white body pillow with an image of the woman on the right on it.

same process
Anonymous No.105750566 [Report] >>105750579
>>105750423
post links btw for the flux image stitching comfyui workflow
>>105750117


>>105750465
Flexibility in what you can do with AI. It might be easier to gen 2 images then try stitching them together than try one regional prompt.
Anonymous No.105750570 [Report] >>105750717
mind you, you can get any objects to interact this way, it doesnt have to be waifus. you could put an image on a vase or painting for example. but if kontext doesnt know a character, you can use this as a workaround for no lora, if there isnt one. want (anime character), just use them as a source.

same process, but with cinderella (nikke)
Anonymous No.105750579 [Report]
>>105750566
it's the default kontext workflow plus >>105750445

for quick stitching

https://docs.comfy.org/tutorials/flux/flux-1-kontext-dev#flux-1-kontext-dev-basic-workflow
Anonymous No.105750652 [Report] >>105750661
the anime girl on the right is holding a portrait of the cartoon frog on the left. keep their expressions the same.

even with a cropped image of the girl it did well:
Anonymous No.105750661 [Report]
>>105750652
and the default output:
Anonymous No.105750663 [Report] >>105750724
>>105750530
add rainbow hair
add R on clothes
become slutty
you are rocketnow
Anonymous No.105750678 [Report] >>105750799
Is there a sampler/scheduler combo for the cfg1 Chroma that helps with the baked images or do I just have to deal with it?
Anonymous No.105750703 [Report]
Anonymous No.105750716 [Report]
the cartoon frog on the left is holding a tall white body pillow with an image of the anime girl on the right. keep their expressions the same.
Anonymous No.105750717 [Report] >>105750739
>>105750570
Can it mimic art styles? Like, "draw the character on the left in the art style of the right"
Anonymous No.105750724 [Report]
>>105750663
>become slutty
hot
Anonymous No.105750739 [Report]
>>105750717
I think it's primarily for interactions but not sure will have to test more
Anonymous No.105750748 [Report]
Anonymous No.105750764 [Report] >>105750773
someone try if it can copy tattoos from one body to another
Anonymous No.105750773 [Report]
>>105750401
Last british gen

>>105750764
Inpainting is probably still better. Completely guessing.
Anonymous No.105750774 [Report] >>105750821
the anime girl on the left is holding a magazine with an image of the anime girl on the right on the cover. The title of the magazine is "LDG".
Anonymous No.105750775 [Report] >>105750790 >>105750802
>drag official comfyui vace v2v mp4 to the webui
>doesn't load the workflow
>already the latest comfyui version
help
Anonymous No.105750790 [Report]
>>105750775
works on my machine. redownload file, restart comfyui, reboot pc.
https://docs.comfy.org/tutorials/video/wan/vace#1-workflow-download-2
Anonymous No.105750799 [Report] >>105750880
>>105750678
how many have you tried? was gonna do the deed and do some grids but we're in the middle of a heatwave. there is a rescale cfg node in the comfy core, maybe try that?
Anonymous No.105750802 [Report] >>105750815
>>105750775
works on my machine. redownload file, restart comfyui, reboot pc.
https://docs.comfy.org/tutorials/video/wan/vace#vace-video-to-video-workflow
Anonymous No.105750815 [Report]
>>105750802
I did all that
Anonymous No.105750821 [Report] >>105750912
>>105750774
make one of Asuka reading a book saying "#1 Waifu /a/ward" with pic of Rei
Anonymous No.105750880 [Report] >>105750938
>>105750799
I'm just randomly trying shit. But it also gets mixed with the chroma artifacts themselves so no clue.
>cfg rescale
Should I even use that if the cfg is 1? I'll try that in the standard chroma tho.
Anonymous No.105750906 [Report] >>105750938 >>105750943 >>105751009 >>105751017
on average how much time do anon spend on inpainting?
just curious
Anonymous No.105750912 [Report] >>105750925
>>105750821
its a bit tricky but simple enough to make asuka with a blank book without a stitch prompt:
Anonymous No.105750925 [Report]
>>105750912
then a simple shoop does it:
Anonymous No.105750938 [Report] >>105750962 >>105750963 >>105750971
>>105750906
days, weeks of my life gone. sometimes more than an hour per gen. dozens of gens per set. it's fun! ..
>>105750880
was just an idea, would be nice if it works. I only ran chroma cfg1 once last night and some gens were close to being baked, yeah. euler seemed the least problematic
Anonymous No.105750943 [Report]
>>105750906
I don't inpaint lmao. I press generate and get 1500x2000 big booba images
Anonymous No.105750948 [Report] >>105750956
and kontext makes it easy to make this stuff:

anime girl is holding a blank white painting with a black frame.
Anonymous No.105750952 [Report]
Anonymous No.105750956 [Report]
>>105750948
or

anime girl is holding a blank white painting with a black frame. keep her expression the same.
Anonymous No.105750957 [Report] >>105751142
one more with this img:

the anime girl is wearing a black business suit. keep her expression the same.
Anonymous No.105750962 [Report] >>105750971 >>105750994
>>105750938
I can't imagine that. My 4090 can do kontext gens in 29 seconds. Regular forge gens in less than 10. And the comfyui 5 second video gens in only like 3 mins
Anonymous No.105750963 [Report] >>105750971
>>105750938
>sometimes more than an hour per gen
Nigga what are you doing?
Anonymous No.105750971 [Report]
>>105750962
Fucker. Us Linux + AMD users are the most oppressed. I'd generate an image but I'm messing with i2v again.
>>105750938
>>105750963
He's got to be overutilizing vram. Going over 90-95% turns GPU based processing into hanging on CPU processing.
Anonymous No.105750974 [Report]
is there a big quality difference between vace 1.3b and 14b?
Anonymous No.105750979 [Report] >>105750986 >>105750992 >>105751010
the anime girl on the left with blue hair is waving hello to the cartoon frog on the right. they are standing on a sunny beach. the cartoon frog is wearing a red shirt and blue shorts. keep their expressions the same.

left image is the stitch/concatenate source from adding 2 images.
Anonymous No.105750984 [Report]
Hmm, this frame seems better.
Anonymous No.105750986 [Report]
>>105750979
excuse the potato quality
Anonymous No.105750992 [Report] >>105750998
>>105750979
I can't wait for the day a kontext-like model/workflow thats not lobotomized and censored is released.
Anonymous No.105750994 [Report] >>105751032
>>105750962
just checked, one inpaint takes 4 secs, hyper lora-ed sdxl on a 3090. I just want things to be perfect and I don't always find the exit. I know, its silly, but w/e. usually tho, a few mins per gen including krita stuff. bla
Anonymous No.105750996 [Report] >>105751031
>>105749147
>You should probably be using Lightx2v instead.
I am. The rentry says FusionX should be a "drop-in replacement" for the default Wan Model when using the Lightx2v workflow, and mentions that Lightx2v should be dropped to 0.6 strength.
I have it set up like so.
FusionX gguf> Lightx2v (@0.6str)> General loras> PatchModelOrder> TorchCompileModelWanVideo> WanVideoNAG > Apply RifleXRoPE WanVideo

Here's a screenshot of the main bit of the workflow. Maybe someone can glean something from it. Not pictured are the video combine nodes.
Anonymous No.105750998 [Report]
>>105750992
the clothes remover lora is surprisingly effective and thats a day 1 lora, we can fix all the censorship bs, the model itself knows how to pose characters/people or do diff stuff
Anonymous No.105751003 [Report]
the cartoon man with a white face on the left is standing beside the cartoon frog on the right. they are standing on a sunny beach. the cartoon frog is wearing a red shirt and blue shorts. keep their expressions the same.

kek, if I didnt say white face it made it a generic guy
Anonymous No.105751009 [Report] >>105751016
>>105750906
i don't really inpaint i just slap the text and logos i want on there with krita then run that bitch back thru noob-inpaint to blend it
i spend more time erasing fucking extra fingers than i do anything else probably
nothing more infuriating than getting exactly what you want but a hand is a little messed up or some shit
Anonymous No.105751010 [Report]
>>105750979
This is really amazing. Flux is pretty shitty at doing raw gens, but feed it stuff and it can work with it very well. Recognizing the style, positioning, and filling in the details. Great work!
Anonymous No.105751012 [Report]
>>105745996
noice
Anonymous No.105751016 [Report] >>105751072
>>105751009
ai still struggles with hands in 2025?
Anonymous No.105751017 [Report]
>>105750906
I haven't done manual inpainting in ages.
Anonymous No.105751020 [Report]
Anonymous No.105751027 [Report] >>105751034
the cartoon man with a white face on the left is swimming in the ocean near the cartoon frog on the right who is on a fishing boat. the cartoon frog is wearing a red shirt and blue shorts. a red cooler with beers is at the front of the boat. keep their expressions the same.

you could theoretically generate 1 billion pepes a day with this model with random text.
Anonymous No.105751031 [Report] >>105751040
>>105750996
it's the fusionX model, nigga
use the regular Q6 wan2.1 instead. Then manually add whatever loras fusionX has in its merge. Remove one by one
Anonymous No.105751032 [Report] >>105751069
>>105750994
I'm curious how much faster a 5090 is over a 4090. Anyone out there do a comparison? I always get demoralized when searching AI advice or tutorials because 100% of them are made by in comprehension me Indians. The stereotype is so true lmao.
Anonymous No.105751033 [Report]
the cartoon man with a white face on the left is wearing plate armor and holding a sword and shield, near the cartoon frog on the right who is holding a spear and a shield. the cartoon frog is wearing a red shirt and blue shorts. they are standing in a grass field in the medieval era. keep their expressions the same.
Anonymous No.105751034 [Report] >>105751039
>>105751027
You can use comfyui dynamic prompts.
https://github.com/adieyal/comfyui-dynamicprompts

You can set them to increment through each section of your dynamic prompts to make repeatable imagesets. You run into the same problem with early attempts of using text to image, you lose coherency through frames.

(warning porn)
https://files.catbox.moe/hocp44.jpg
Anonymous No.105751039 [Report] >>105751069
>>105751034
Wtf I just got promoted
Anonymous No.105751040 [Report] >>105751265
>>105751031
Yeah I know that.
>>105748417
>I'm thinking it's the FusionX model though, as the normal Wan model keeps things stable. Only problem is base Wan keeps shit really stiff and slow motion (even with the workflow in the rentry).
Regular Wan outputs slow motion shit even with RifleX.
>use the regular Q6 wan2.1 instead. Then manually add whatever loras fusionX has in its merge. Remove one by one
That's what I'm doing now.
Anonymous No.105751041 [Report]
>>105747477
naisu
Anonymous No.105751045 [Report] >>105751055
the cartoon man with a white face on the left is wearing plate armor and is kneeling on the floor, near the cartoon frog on the right who is sitting on a throne and wearing a gold crown and red robe. they are standing in a throne room in a medieval castle. keep their expressions the same.

not even the flux pepe lora was this effective.
Anonymous No.105751054 [Report]
>>105747883
double nice
Anonymous No.105751055 [Report]
>>105751045
and it's just using image stitch node as a source (generate the 3 nodes then drop it in your image source)
Anonymous No.105751063 [Report] >>105751066 >>105751069 >>105751156
Anonymous No.105751066 [Report]
>>105751063
Little bit too banana shaped for me
Anonymous No.105751069 [Report] >>105751117
>>105751032
techpowerup claims about a 30% performance increase.
FP64 1200 GFLOPS vs 1600 GFLOPS
https://www.techpowerup.com/gpu-specs/geforce-rtx-4090.c3889
https://www.techpowerup.com/gpu-specs/geforce-rtx-5090.c4216

>>105751039
I specifically used the combinatorial prompts node, control after generate: increment, autorefresh: yes, and structured it like this. It tells the node to go through them 1 at a time, adding each of the tag sets to your base prompt. The only problem is you need to close the tab or something, because cancelling leaves it at the last seed increment.
@{tags1,|
tags2,|
tags3,|
tags4,|}
>>105751063
OUHHHH SAG EROTIC
Anonymous No.105751072 [Report]
>>105751016
it still makes mistakes at odd angles
an open palm or fist is usually fine
a "cute" hand pose with the fingers splayed is also good, but when the fingers start to occlude each other like picrel it tends to forget to draw the visible knuckles for the occluded fingers and starts blending them together instead. stuff like holding a cigarette or a pencil still seems to give it trouble.
i usually throw "4 fingers 1 thumb" in the prompt but it's not entirely foolproof.
Anonymous No.105751085 [Report] >>105751108
one more, with a diff image + stitch

the anime girl on the left wearing a black swimsuit and black blindfold is standing beside the cartoon frog on the right. they are standing on a sunny beach. the cartoon frog is wearing a red shirt and blue shorts. keep their expressions the same.

cute!
Anonymous No.105751103 [Report] >>105751116
Can someone with kontext try getting the jewelry/watch/grill from this image onto a new character, I've had really good results with kontext otherwise.
Anonymous No.105751105 [Report]
>>105750217
guh-guff
Anonymous No.105751108 [Report] >>105751142
>>105751085
added tall girl and short frog to change the proportions, works:
Anonymous No.105751116 [Report] >>105751125
>>105751103
I'd try inpainting. That's too low quality. You're better off finding hires grill/watch, editing onto a character, then inpaint.
Anonymous No.105751117 [Report] >>105751215
>>105751069
Thats just numbers on a chart by brown people "writing" tech articles though. I mean real world timer on gen time in forge and comfy etc between 4090 and 5090
Anonymous No.105751125 [Report]
>>105751116
yeah I figured, was just trying to see how far kontext could really go
Anonymous No.105751137 [Report] >>105751141
replace the newspaper the green cartoon frog is reading with a white book. The book has the text "LDG lewd outputs" with a picture of a blonde anime girl below it, on the cover.
Anonymous No.105751141 [Report]
>>105751137
Stop reading Cunny Limited in public that's not allowed
Anonymous No.105751142 [Report] >>105751145
is there a way to do this kontext imageconcat to replace one character with another from a different image? for example: replace 2b here >>105751108 with asuka from >>105750957
Anonymous No.105751145 [Report]
>>105751142
stitch image node, then say "replace the white hair anime girl on the left with the red hair anime girl on the right"

might work
Anonymous No.105751156 [Report] >>105751161
>>105751063
I love it how it sometimes showcases spatial awareness as good as Wan's despite it being an image-only model
Anonymous No.105751161 [Report]
>>105751156
I wish it didn't make them Cross-eyed so often
Anonymous No.105751169 [Report] >>105751173
remove the black hat of the white cartoon man on the left and replace it with a sombrero.
Anonymous No.105751173 [Report] >>105751179 >>105751196
>>105751169
remove the black hat of the white cartoon man on the left and replace it with a sombrero. give the white cartoon man a curly moustache. change the coffee cup to a beer bottle.

what a neat model.
Anonymous No.105751179 [Report]
>>105751173
also, added "keep the expression the same."

retains the face.
Anonymous No.105751196 [Report] >>105751215
>>105751173
replace the green cartoon frog with a slim anime version of Miku Hatsune.

unlimited potential with this model for edits.
Anonymous No.105751201 [Report]
>>105749796
scratch?
>>105749883
unsure what you mean by fringing but i agree low contrast fits the style more
Anonymous No.105751215 [Report]
>>105751117
It's a real performance metric though. Anything else and you need someone who owns both to test.
>>105751196
KEK. Nice gens. It's pretty good at keeping the style.
Anonymous No.105751222 [Report] >>105751232 >>105751263
change the text from "Exodia The Forbidden One" to "Miku Hatsune The Chosen One". Replace the image in the center with an image of Miku Hatsune who is smiling.
Anonymous No.105751228 [Report]
>>105748412
forsen
Anonymous No.105751232 [Report] >>105751244
>>105751222
also kontext is an amazing tool for duping text for fonts that seemingly dont exist online, or are impossible to figure out.
Anonymous No.105751244 [Report] >>105751256
>>105751232
change the text from "NIKKE" to "LDG". Change the text from "goddess of victory" to "image gen general".
Anonymous No.105751256 [Report] >>105751257 >>105751269
>>105751244
change the anime girl's hair color to blonde. remove her hat.
Anonymous No.105751257 [Report]
>>105751256
Its so over.
Anonymous No.105751263 [Report]
>>105751222
This is crazy
Anonymous No.105751265 [Report] >>105751395
>>105751040
Alright, so it seems like it's something fucked with the workflow in the rentry... and/or my tweaks to it. The Workflow included with the FusionX "lora" (literally just each merged lora loaded one by one), doesn't have the face issue.
Anonymous No.105751269 [Report] >>105751273 >>105751275
>>105751256
change the anime girl's hair color to red. remove her hat. she is facing the camera directly.

so many edit possibilities and things you can do.
Anonymous No.105751273 [Report] >>105751332
>>105751269
>so many edit possibilities and things you can do
I still can't get it to make goatse wear pants. Is it just a prompt skill issue or does it have no idea what's going on?
Anonymous No.105751275 [Report] >>105751276
>>105751269
That's a completely different anime girl though.
Anonymous No.105751276 [Report] >>105751287
>>105751275
I didnt include "keep the same expression" which is important, like keeping all the pepe faces the same. there is a lot of flexibility.
Anonymous No.105751287 [Report] >>105751334
>>105751276
Nah, I mean the hairstyle and length is completely different, her outfit is only vaguely similar, and her eye color is different as well.
Anonymous No.105751312 [Report]
Anonymous No.105751332 [Report] >>105751398
>>105751273
I'd manually paint over his asshole, or photoshop him holding an orange. Great test lol.
Anonymous No.105751334 [Report]
>>105751287
the model does much better with a full body reference, otherwise it has to guess what their figure is like or whatever.

ie: the anime girl is sitting in a beach chair reading a book. she is wearing the same swimsuit. keep her expression the same.
Anonymous No.105751344 [Report] >>105751351
"Remove the text"
not bad
Anonymous No.105751351 [Report] >>105751359
>>105751344
KEK. Also it did very good. Shit is going to get scary when this software proliferates. Everything online is going to be fake.
Anonymous No.105751359 [Report] >>105751389 >>105751396 >>105751397 >>105751458
>>105751351
Normalfags still don't know about this stuff and Indians are still stuck using very limited free models because they're poor and brown. Its a golden age of autists with expensive gear doing whatever they want like how the whole internet was until the late 2000s. It won't last. First time some boomer spams a bunch of AI CP everywhere the free wheeling Is over.
Anonymous No.105751368 [Report]
Lands of Lore 1 lora, a little over halfway through training
Anonymous No.105751370 [Report] >>105751385
the anime girl is standing on a sandy beach with an ocean behind her and palm trees nearby. she is wearing the same swimsuit and holding a tropical drink. she is smiling.

pretty good considering anis is so thick you couldnt see her bottoms in the source image so the model had to guess.
Anonymous No.105751385 [Report] >>105751397
>>105751370
better booba, other one had artifacts
Anonymous No.105751389 [Report]
>>105751359
>First time some glowie spams a bunch of AI CP everywhere the free wheeling Is over.
ftfy
Anonymous No.105751395 [Report] >>105751458 >>105751480
>>105751265
The FusionX Recipe Workflow uses a different weight for the MPS Reward LoRA because of complaints about it fucking up faces in i2v. The GGUF and LoRA versions of FusionX haven't been re-merged with the lower weight.
Anonymous No.105751396 [Report]
>>105751359
well it's like roop/reactor. people could make deepfakes with it, but as long as people arent retarded and spam it everywhere, things will be fine.
Anonymous No.105751397 [Report]
>>105751359
Yup. I'd prefer normies get off the internet, than it getting locked down, which will happen anyways.
>>105751385
No she just has quad boobs because she is superior.
Anonymous No.105751398 [Report]
>>105751332
I drew some clothes over him, changed the prompt a little, hope it works
Anonymous No.105751406 [Report]
stay hydrated ldg!

the anime girl is standing on a sandy beach with an ocean behind her and palm trees nearby. she has large breasts and is wearing the same swimsuit and drinking a bottle of water. she is smiling.

emphasizing large breasts seems to keep them gacha-tier.
Anonymous No.105751429 [Report] >>105751450
the anime girl is standing on a sandy beach with an ocean behind her and palm trees nearby. she has large breasts and is wearing the same swimsuit and drinking a bottle of water. a blonde anime woman with small breasts nearby is looking down at the ground, dejected.

kek
Anonymous No.105751432 [Report]
>>105749511
Joycaption
Anonymous No.105751436 [Report]
Anonymous No.105751450 [Report] >>105751471
>>105751429
the blonde anime girl on the bottom right is holding a white sign that says "IT'S OVER" in scribbled font.

what a fun model. all off a swimsuit photo.
Anonymous No.105751458 [Report] >>105751489
>>105751395
Good to know. That being said, even when using base Wan and loading the models in sequence with the weights used in the workflow, still results in fucked up faces. Same issue with MPS turned off.
Maybe CausVid and AccVid are causing issues with Lightx2v and the NAG shit, even thought Light is turned to 0.6

>>105751359
All it takes is one malicious retard doing it to poison the well, or an especially spiteful retard pissed at AI to learn how the shit works to spam some celeb's twitter with genned porn.
Anonymous No.105751471 [Report] >>105751492
>>105751450
the anime girl on the left is holding two blue milk containers with the text "MILK" on the container, with both hands.
Anonymous No.105751480 [Report] >>105751488
>>105751395
>The FusionX Recipe Workflow
where? In the rentry?
Anonymous No.105751488 [Report] >>105751521
>>105751480
No, on the page for FusionX.
https://civitai.com/models/1690979
Anonymous No.105751489 [Report]
>>105751458
AI takes the surveillance state to its natural conclusion. Embedded hardware IDs in your DSLR, PC, phone, and all files you use are auto-tagged with who made them, modified them, when, and how. A digital ID to access the "official" internet, while the "unofficial" has as many taylor swift furry gangbang compilations you want.
Anonymous No.105751492 [Report]
>>105751471
remove the anime girl on the bottom right. the anime girl on the left turns away 180 degrees facing away from the camera.

actually impressive, wan could rotate stuff too
Anonymous No.105751521 [Report] >>105751540
>>105751488
so which lora actually caused face shift?
Anonymous No.105751537 [Report]
>>105748529
Prompt info is right there if that's what you're asking.
Anonymous No.105751539 [Report]
>>105751533
>>105751533
>>105751533
>>105751533
Anonymous No.105751540 [Report]
>>105751521
MPS Reward is generally regarded as the one that shifts faces. It's supposed to do other "useful" shit with motion, but the drawback is the face thing.

That's not MY face issue, per-se, but for other anons it is.
Anonymous No.105753166 [Report]
>>105748241 (OP)
Degen thread.