← Home ← Back to /g/

Thread 106221281

322 posts 158 images /g/
Anonymous No.106221281 [Report] >>106221296 >>106222956 >>106225561
/ldg/ - Local Diffusion General
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106217746

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://github.com/Wan-Video
2.1: https://rentry.org/wan21kjguide
2.2: https://rentry.org/wan22ldgguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-HD/tree/main
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
statler/waldorf No.106221296 [Report]
>>106221281 (OP)
>>106220631
ill take that $10 meow tho
Anonymous No.106221300 [Report] >>106221306
Blessed thread of frenship
Anonymous No.106221301 [Report] >>106221355 >>106221373 >>106221443 >>106221450 >>106221454 >>106223375 >>106223632
Reposting, digital camera lora


https://files.catbox.moe/hn8034.safetensors

Just prompt for photography and it should work.
statler/waldorf No.106221306 [Report] >>106221455
>>106221300
>>106217785
>>106217761
Anonymous No.106221355 [Report]
>>106221301
looks great, testing right now
Anonymous No.106221373 [Report]
>>106221301
Based anon, looks good
Anonymous No.106221423 [Report] >>106221430 >>106221536
How can I make it look more real? Where can it be improved?
Anonymous No.106221430 [Report]
>>106221423
Prompt "candid amateur photograph", never fails
Anonymous No.106221436 [Report]
Anonymous No.106221443 [Report] >>106221448
>>106221301
Anon, this is great! Using iterative pixel space upscale with 2-3 steps pushes in new details from the lora.
Anonymous No.106221448 [Report] >>106221468
>>106221443
Don't use Chroma Annealed
Anonymous No.106221450 [Report]
>>106221301
You should post it on Civitai as well
Anonymous No.106221454 [Report] >>106221458
>>106221301
Great let me just try this on 50 different chroma models to figure out which one its for
Anonymous No.106221455 [Report] >>106222104
>>106221306
You need to go back
Anonymous No.106221458 [Report]
>>106221454
Just use regular Chroma v50
Anonymous No.106221468 [Report] >>106221471
>>106221448
Probably doesn't matter much with lora. 49 might be the way to go.
Anonymous No.106221471 [Report]
>>106221468
crazy eyes. Avoid
Anonymous No.106221511 [Report]
>a static shot,
>the car turn lights are blinking,
>the girl is standing enjoying the wind as it moves and rattles her clothes and hair,
>gusty wind moves the grass and nature around,
>text appears slowly in the black bar at the bottom that says "WAN2.2 the best goon generator"
Anonymous No.106221525 [Report] >>106221527 >>106221528 >>106221532 >>106221577 >>106221732
Are you guys keeping track of Ace Step? The dev claimed on the server that they are aiming to beat Suno in the next releases
Anonymous No.106221527 [Report] >>106221544 >>106221560
>>106221525
Will they release the model for local use?
Anonymous No.106221528 [Report]
>>106221525
Aware? Yes. Keeping track? No. It really was more of a "Oh that's kind of neat" release for me. It wasn't good enough to really use for anything. Good to hear it's still progressing though, because it was close.
Anonymous No.106221532 [Report] >>106221560
>>106221525
>beat Suno

That's a low bar, and I sleep. Let me know when they plan on catching up to Udio.
Anonymous No.106221536 [Report]
>>106221423
Created using qwen btw
Anonymous No.106221544 [Report] >>106221561 >>106221602
>>106221527
Yes
Anonymous No.106221550 [Report]
>>106218880
literally just try it out, idk if it'll work on kijai's shit but it worked for me on native. I have sage and torch compile on my workflow.
Anonymous No.106221553 [Report]
Anonymous No.106221560 [Report] >>106221572 >>106221614 >>106223937
>>106221527
Yes, they already released 1.0

Looking at their roadmap, they are now at:
>Train and Release ACE-Step V1.5

So I would assume it is currently training, no idea how long it takes, we don't know how long the first model took since it was just released out of the blue, and this will undoubtably be bigger.

>>106221532
Local Suno quality would be fantastic, also with local you can train loras or do full finetunes of the model with any music you want.
Anonymous No.106221561 [Report] >>106221602 >>106221618
>>106221544
God I hate the fact so much info is squirreled away in a discord.
Anonymous No.106221572 [Report]
>>106221560
I will keep my expectations low for now, but it's a good thing to have a local suno like model, nothing some finetuning and loras can't fix like you said.
Anonymous No.106221577 [Report] >>106221601 >>106221614
>>106221525
More context
Anonymous No.106221601 [Report]
>>106221577
Anonymous No.106221602 [Report] >>106221616 >>106221631
>>106221544
>>106221561
https://vocaroo.com/1bacNudadSF7
sounds like shit desu, still has the squelchy stability VAE. guess we're gonna have to wait even longer for a proper Udio competitor that isn't just another chinese pop rap slop dataset
Anonymous No.106221607 [Report]
Anonymous No.106221613 [Report] >>106222116
comfy should be dragged out on the street and shot
Anonymous No.106221614 [Report] >>106221638 >>106222022
>>106221577
>>106221560
Seems they're taking it seriously. Maybe we'll have an SD moment for audio soon.
Anonymous No.106221616 [Report]
>>106221602
They haven't finished training
Anonymous No.106221618 [Report] >>106221628
>>106221561
now imagine how many support threads of many products are on discord only, and will never help anyone like open forums before
discord was made to chat and video, not to be a fucking support forum
Anonymous No.106221620 [Report] >>106221637
Does Chroma still have SD1.5 anatomy?
Anonymous No.106221628 [Report] >>106221644
>>106221618
I unironically think it has ruined the internet. We have no idea how many solutions to problems and useful resources are locked up in some random discussion on some discord with dead invite links
Anonymous No.106221631 [Report] >>106221649
>>106221602
>proper Udio competitor
God I want that so much, especially as udio didn't change so much lately, they don't give a shit since they're de facto one of the best in this, no fire under their asses.
I don't hate their service but man I hate how censored it is, both for musical IP and profanities.
Anonymous No.106221635 [Report]
is there an inpaint version of chroma?
Anonymous No.106221637 [Report]
>>106221620
Yes.
You are about to be approached by people who will say literally anything to convince you otherwise, but the problems that plagued Chroma since its inception are still more or less present.
Anonymous No.106221638 [Report]
>>106221614
We won't have a "stable diffusion moment" for music gen until lora training for songs is actually a thing
I remember reading the original Ace Step being garbage to train
Anonymous No.106221644 [Report] >>106221661
>>106221628
oh I agree with you, there are probably many cool AI channels not even in English nested there, with plenty cool tech and loras and ideas, and we'll never know
Anonymous No.106221649 [Report] >>106221678
>>106221631
>Udio
do we even know the size of their 130s model?
Anonymous No.106221660 [Report]
Anonymous No.106221661 [Report] >>106221709
>>106221644
I agree with you too. I was just venting frustration. Like if you know the discord exists and are in it, it's fine, but you can never be fully aware of every single discord.
Anonymous No.106221678 [Report] >>106221713
>>106221649
nta, no idea about the size, but those cloud song gen models feel they are something that could indeed run local on a simple 3090/4090, even if it has to be quantized
those labs are not big with dedicated clusters like google etc, yet their models generate audio kinda fast
Anonymous No.106221709 [Report] >>106221715 >>106221758
>>106221661
Problem is that every community is being funneled into discord - in order to ask a question - even if you are a professional in some task, means that you will face 10 answers from retards. Just like on steam forums.
It's a waste of time and dignity.
Anonymous No.106221713 [Report]
>>106221678
oh they need a few minutes to gen the 130s audio and they often go down and error out, so I suspect they're kinda big, but I have no reference point so maybe they're just "LLM light"
Anonymous No.106221715 [Report] >>106221720 >>106221724 >>106221805
>>106221709
Just like this place? Have you seen the /ldg/ discord? It's insane how much valuable information is in there.
Anonymous No.106221720 [Report]
>>106221715
no one outside of it will never know, and that's the issue
Anonymous No.106221724 [Report] >>106221737
>>106221715
>Wait, since when do we have a discord?
Anonymous No.106221732 [Report] >>106221747
>>106221525
I'm not interested in music gen but I am interested in vocal synthesis
Particularly, speech editing while retaining prosody
Anonymous No.106221737 [Report] >>106221768
>>106221724
Sorry. I've said too much.
Anonymous No.106221747 [Report] >>106221958
>>106221732
Oh but they do claim to support lyric editing which is basically what I need.
Has anyone tried it? I need to edit a Costanza video
Anonymous No.106221758 [Report] >>106222650
>>106221709
at least steam forums, and forums in general, enforce a format that slows down bullshit a little
using chat format for support is so bad, people just write whatever shit without thinking, half the questions don't even get answers and are drown out in the chatter if the support system wasn't thought out, and any already answered question cannot even be easily searched, even worse with discord horrible fuzzy search
Anonymous No.106221760 [Report] >>106221766 >>106221767 >>106221775 >>106221786 >>106221793 >>106221808 >>106221868 >>106221972
https://www.reddit.com/r/StableDiffusion/comments/1mn818x/nvidia_dynamo_for_wan_is_magic/

VRAM is solved???
Anonymous No.106221766 [Report]
>>106221760
I'll wait for anon to actually confirm it's good. The OP of that thread reeks of ESL and their reasoning is akin to explaining everything away with advanced alien tech.
Anonymous No.106221767 [Report]
>>106221760
if true this is amazing
I have plenty ram
Anonymous No.106221768 [Report] >>106222126
>>106221737
pretty please?
Anonymous No.106221775 [Report]
>>106221760
so... block swapping?
Anonymous No.106221786 [Report] >>106222029
>>106221760
I'm not opening that. give me tldr
Anonymous No.106221793 [Report] >>106221799
>>106221760
huh, is this the reason I can run the 15gb Q8s on a 3060 12gb for no speed penalty?
Anonymous No.106221799 [Report]
>>106221793
gguf is irrelevant to nvidia, so no
Anonymous No.106221805 [Report]
>>106221715
sounds nice, i'll just give discord my phone number, address, and social security number so i can read it all
Anonymous No.106221808 [Report]
>>106221760
Not compatible with kijai's own video model output.
Great start.
Anonymous No.106221868 [Report] >>106221888 >>106221891 >>106221936
Should I trust what this guy says?
>>106221760
Anonymous No.106221888 [Report]
>>106221868
rip
Anonymous No.106221891 [Report]
>>106221868
that's kijai the god so i'd trust his word over anyone else
Anonymous No.106221897 [Report] >>106221902 >>106222493
Absolutely brutal mogging.
Anonymous No.106221902 [Report] >>106221909 >>106222584
>>106221897
Damn it. Posted the image again.
Anonymous No.106221909 [Report] >>106221916
>>106221902
both have a dick
Anonymous No.106221916 [Report]
>>106221909
Sorry. Only the shorter one does.
Anonymous No.106221936 [Report] >>106222488
>>106221868
So its complete bullshit then? kek. But seriously though, are there any nodes for gguf that can allow for more vram room/extended length gens/etc? I can get to 8 sec gens with the wan2.2 all in one but any further and I OOM
Anonymous No.106221958 [Report]
>>106221747
Damn that was awful
Anonymous No.106221972 [Report]
>>106221760
Some retard who doesn't understand what he is talking about.

Torch compile has been around for a LONG time, like 2.0 or something, and vram offloading has nothing to do with torch compile, and it's been done automatically in Comfy, Forge, trainers etc for a long time as well.

Also has nothing to do with Nvidia Dynamo
Anonymous No.106222022 [Report]
>>106221614
The 1.0 Ace Step model was easily the best local music model yet, but still like Suno 2.0 in quality which just isn't near good enough.

So here's hoping they deliver with 1.5, being able to train / finetune on your favorite artists / bands in ~Suno 4.5 quality would be awesome
Anonymous No.106222029 [Report]
>>106221786
some retarded redditor listed a bunch of things everyone is already doing and thinks its new information.
Anonymous No.106222055 [Report] >>106222063 >>106222070 >>106222081 >>106222101
how long would it take for kontext to colourise every frame of a 80~ minute movie
Anonymous No.106222063 [Report]
>>106222055
Insanely long and it would look awful.
Anonymous No.106222070 [Report]
>>106222055
80 x 60, 4800 seconds
24 frames to a second, 115,200
average 2 minutes per frame...
about 23 weeks
Anonymous No.106222081 [Report] >>106222151
>>106222055
Anonymous No.106222101 [Report] >>106222113
>>106222055
Colorize one frame using Kontext, take the time it took and multiply against the frames per second of the movie, then multiply that against the number of seconds in the movie which in this case is 4800 (60 * 80)

Of course it will still look like shit since there's no guaranteed consistency in the coloring.
statler/waldorf No.106222104 [Report]
>>106221455
to your moms? shes had enough believe me and i would know
Anonymous No.106222113 [Report]
>>106222101
>Colorize one frame
>Use it as the reference in vace
>Use the rest of the clip as the target output with a depth and canny preprocessor
>Do this for the next several thousand or so 5~ second clips.

Still an absolutely pointless endeavor but slightly more consistency. But I think there are better ways to colorize footage than using AI desu.
Anonymous No.106222116 [Report]
>>106221613
Just press update, it’ll be okay anon
Anonymous No.106222126 [Report] >>106222135 >>106222138
>>106221768
https://discord.gg/gK4hw3Dm
Anonymous No.106222135 [Report] >>106222335
>>106222126
my mom told me not to trust a board of strangers
statler/waldorf No.106222138 [Report]
>>106222126
>discord
maybe everyone in this general can return to discord and /g/ will be free from sloppa
Anonymous No.106222148 [Report] >>106222154
what model should black forest lab launch to beat qwen?
Anonymous No.106222151 [Report]
>>106222081
rofl x)
Anonymous No.106222154 [Report]
>>106222148
Flux Safe Dev. I can take any image input and make it safe.
Anonymous No.106222162 [Report] >>106222182 >>106222208
speaking of torch compile, would it be pointless to use if im unloading the model + clearing vram after every run?
Anonymous No.106222168 [Report]
Anonymous No.106222180 [Report] >>106222201
Another brutal mogging
Anonymous No.106222182 [Report]
>>106222162
depends.
there are fast startup compile methods
Anonymous No.106222201 [Report]
>>106222180
I'd still rather the short one, cute
Anonymous No.106222206 [Report] >>106222222
Anonymous No.106222208 [Report]
>>106222162
Ask AI.
https://playground.perplexity.ai/

>No, using torch.compile is generally not useful in your workflow if you unload the model and clear VRAM after every generation. Here's why:
tl;dr, Yes, it is pointless.
Anonymous No.106222222 [Report] >>106222255 >>106224865
>>106222206
Makes want to gen my own sukeban deka show, but now every female is topless due to a new dress code being enforced
Anonymous No.106222226 [Report] >>106222251 >>106222277
why am I not as excited as the sd 1.5 days?
Anonymous No.106222230 [Report]
Filter Chroma shilling wot this

(?i)(chroma|\bv([1-9]|[1-4][0-9]|50)\b)
Anonymous No.106222242 [Report]
I love enforcing dress codes
Anonymous No.106222251 [Report] >>106222276 >>106222774
>>106222226
As you grow old everything is less exciting. This continues until you eventually welcome death.
Anonymous No.106222255 [Report] >>106222289
>>106222222
Wasted
Anonymous No.106222271 [Report]
Is it possible to make a video type that supports transparency in comfyui? Like have the subject move against a high contrast background, use background remover to isolate the subject in each frame then string them all together into a codec that supports transparency?
Anonymous No.106222276 [Report] >>106222286 >>106222427
>>106222251
Speak for yourself. I was super excited for Wan2.2 and am pleased with the results. I look forward to how AI progresses. I'm also 35.
Anonymous No.106222277 [Report]
>>106222226
Because your potato pc can't run anything better than sd 1.5
Anonymous No.106222286 [Report] >>106222303
>>106222276
I'm excited too, but I remember being much more excited for things in the past. I remember running the video game store every day to ask if Halo 2 had leaked yet because I was so excited and the guy at the store got mad at me because I wouldn't stop asking. I don't feel that kind of anticipation any more.
Anonymous No.106222289 [Report]
>>106222255
So was your mom when you were concieved, and every day since then

Bazinga!
Anonymous No.106222303 [Report] >>106222323
>>106222286
you are expecting to have the same enthusiasm as a child? are you serious anon
Anonymous No.106222323 [Report] >>106222350 >>106222363
>>106222303
Of course I am. There is a clearly a progressing pattern of being less excited for things as I get older. You're confusing me acting on my excitement like a child and being excited in general.
Anonymous No.106222335 [Report]
>>106222135
Not asking you to trust me
Asking you to post gen
Anonymous No.106222350 [Report] >>106222385
>>106222323
>You're confusing me acting on my excitement like a child and being excited in general.
because you used an example of being excited as a child to relate to how you are less excited about things now. adult emotions are obviously more complex and nuanced. that does not mean you are 'less excited'. it's just represented differently.

or perhaps you have mild depression.
Anonymous No.106222357 [Report] >>106222410 >>106222491 >>106222534
Anonymous No.106222363 [Report]
>>106222323
Nta but the dopamine rush is Donna be different if only since you already know what to expect from An ‘ai picture generator’
Driving a car for the first time is exiting, even if its a cluncker
Driving in your 4th beamer (now in orange) Will do less for you
Anonymous No.106222385 [Report] >>106222418
>>106222350
>adult emotions are obviously more complex and nuanced
Kek, here are manchildren, and you are one of them. Don't tag yourself as complex, Mr. Sloppa.
Anonymous No.106222410 [Report] >>106222427
>>106222357
AI was made for goon, and I expect to have my personal and local goon model in ten years if I still don't have erectile dysfunction.
Anonymous No.106222418 [Report]
>>106222385
well, anon is still posting and talking about ai, so there's clearly something still there even if anon doesn't see it as excitement
Anonymous No.106222427 [Report] >>106222463
>>106222276
>35
How do you cope with that?>>106222410
Im that anon, same age.
Anonymous No.106222463 [Report] >>106222503
>>106222427
You shouldn't be getting ED until you're like 65+. If I somehow get it I guess I'd have to rely on viagra or something.

right now I have no problem getting hard as a rock and staying hard for hours.
Anonymous No.106222474 [Report] >>106222595
>>106220163
Motion comes from high noise, not lora problem.
>>106220717
Use the 2.1 lora AND lcm sampler. Any other sampler will oversaturate and give weird details.
Anonymous No.106222488 [Report]
>>106221936
b l o c k s w a p

only works with kijai nodes. or just use the virtual ram otion with the gguf loader r e t a r d
Anonymous No.106222491 [Report] >>106222518 >>106222522 >>106222534 >>106222570
>>106222357
With wan 2.2 t2v light LoRA

Same prompt without:

Do not use the 2.2 light LoRA it is hot garbage, there is no defending it.
Anonymous No.106222493 [Report]
>>106221897
i don't not expect my tape measure prompt to work at all. it even drags on the ground
Anonymous No.106222503 [Report]
>>106222463
I'm scared anon, scared that my Asuka android waifu will come when I'm old.
Anonymous No.106222518 [Report] >>106222523
>>106222491
Why don't she start fingering her and have sex right there? Are you gay?
Anonymous No.106222522 [Report] >>106222543
>>106222491
im confused. this one is using light lora?
Anonymous No.106222523 [Report]
>>106222518
Erm, why do you assume it's sexual?
Anonymous No.106222534 [Report] >>106222543 >>106222576 >>106222798
>>106222491
Without. Or that is to say, using the 2.1 version only.

>>106222357
Is using the r*ddit mix of both LoRAs, but it's obvious at this point that the training of the newer LoRA went wrong is some way and should be avoided. It blows out the contrast and kills motion.
Anonymous No.106222543 [Report]
>>106222522
>>106222534
Mean to reply to.
Anonymous No.106222570 [Report]
>>106222491
bodyslam
BODYSLAM HER NOW
Anonymous No.106222576 [Report]
>>106222534
what do you mean anon? it was made for 2.2, clearly it's the one you should use. kontext autist said so
Anonymous No.106222584 [Report] >>106222606 >>106222610
>>106221902
Same prompt without the 2.2 LoRA.
It uhh... might have taken the muscle part a bit too far.
Anonymous No.106222595 [Report]
>>106222474
Heun works, but it's like the single most performance demanding sampler.
Anonymous No.106222601 [Report] >>106222622 >>106222668
so qwen image has edit and umpteen other options, but for some reason i only see t2i.
what's up?
Anonymous No.106222606 [Report] >>106222609
>>106222584
this is what happens if a woman even looks at those pink 2lb dumbells
Anonymous No.106222609 [Report]
>>106222606
Literally the weakest woman in curves.
Anonymous No.106222610 [Report] >>106222625 >>106222660
>>106222584
Is your prompt using emphasis? like (muscular girl):2.5? If so you need to adjust it because it doesnt need those high values since its not using light.

anyway, yeah this looks WAY better. It's like a fucking movie scene. I fully believe people who shill light are only doing shitty anime gens or are blind.
Anonymous No.106222617 [Report] >>106222630 >>106222633 >>106222640 >>106224211
I'm trying to see how well Wan sticks to my prompts, so I'm doing multiple runs with the same settings/seeds but changing the prompts.

I tried with this image of Hu Tao that I found.

If I prompt "She looks to her left" she turns frame left, which is her right. Vice versa with "She looks to her right"

If I prompt "Another girl enters the video from the right." then nothing happens and I get a video of Hu Tao just kinda sitting there and moving slowly. If I add "She sits down on the bed" then another girl enters from frame right and sits down. If I change right to left then she enters from the left and sits down.

Now here's the part I wanted to test. If I say a character does an action with one of her hands, how does Wan understand it? For example, If I say "She makes a peace sign with her left/right hand" does that mean "her left/right" or our "left/right."

Well what I ended up getting was a bunch of videos of Hu Tao making a peace sign with her right (our left) hand, regardless of what I wrote in the prompt. Finally I got a video of her making a peace sign with her left (our right) hand if instead of specifying "left/right hand" I wrote in the prompt: "The anime style girl makes a peace sign with her hand that is on the right side of the frame. Her hand on the right side of the frame is in front of her making a peace sign. Her hand on the left side of the frame is on the bed."

So my main takeaways are:
1. Prompt adherence seems to improve with more details not in the sense that the movement gets more detailed, but that adding more details produces the movement at all as is the case for the prompt about a girl entering the video.
2. Directions (as in left/right) is ambiguous and may need more detailed prompting. Using frame based directions seems to produce the least ambiguity. No amount of prompting without frame relative directions produced a video of her using her left (frame right) without having to change the details of the movement in the prompt.
Anonymous No.106222622 [Report] >>106222665
>>106222601
because you are incapable of reading or finding information. sorry anon, it's not looking good for your future.
Anonymous No.106222625 [Report] >>106222672 >>106222681 >>106222894
>>106222610
Here it is without the emphases on statuesque and muscular.
But yeah. The 2.2 light lora absolutely fucks outputs and should not be used under any circumstances. There is no usecase.
Anonymous No.106222630 [Report]
>>106222617
>mfw no time for genshin because ai gen has consumed my life
i am sorry dawei i have let you down
Anonymous No.106222633 [Report] >>106222641
>>106222617
>If I prompt "She looks to her left" she turns frame left, which is her right. Vice versa with "She looks to her right"
I've had good success with 'stage right' and 'stage left', right and left are very ambigious.
Anonymous No.106222638 [Report]
Anonymous No.106222640 [Report]
>>106222617
A note about point 2, I was able to generate videos with her making a peace sign with her left (frame right) hand, but it required me to change the prompt by adding in movements for her other hand. Also even though she changed hands, I could not swap her hands by simply swapping left and right in the prompt. With the prompt with frame relative directions, swapping left and right swapped her hands.
Anonymous No.106222641 [Report] >>106222645
>>106222633
nice one snagglepuss
Anonymous No.106222645 [Report]
>>106222641
Huh? Snagglepuss?
Anonymous No.106222650 [Report] >>106222655
>>106221758
This is what I meant. Sorry if my English was bad or polarized.
4chan can be great for learning but you will personally need to learn how to filter out the signal to noise.
And I think that after some point, there's nothing what you can learn from this place. Image gen was something, llms were one thing but once you actually know them... signal to noise becomes too inefficient.
Anonymous No.106222655 [Report]
>>106222650
For US bots - I'm not "Indian" or "jeet" - I'm scandinavian if that matters so much.
Anonymous No.106222660 [Report] >>106222671 >>106222681
>>106222610
i understand that he is using lightx2v 2.1
Anonymous No.106222665 [Report] >>106222687
>>106222622
being a leecher is a way of life.
If no one does qwen edit, it won't be accessible yet. the community dynamic is strong enough not to miss something like that. relevant to know why? not necessarily.
so why bother if I can still swim ahead with the mob?
imagine investing more time than necessary to generate boob slop. kek
Anonymous No.106222668 [Report]
>>106222601
they said it will be released soon in the official readme
Anonymous No.106222671 [Report] >>106222729
>>106222660
Correct. I'm using light 2.1. I'm also running the low noise using Heun, but I don't know if that really matters at this point aside from a hit in performance.
Anonymous No.106222672 [Report] >>106222683
>>106222625
it already takes 250s even with the light lora.
i can't justify waiting so long.

if i lift weights while i wait on gens i'll get far too healthy.
Anonymous No.106222674 [Report] >>106222687
>click unload models button in comfy
>click free model and node cache button
>ram doesn't clear
epic
Anonymous No.106222681 [Report] >>106222696
>>106222660
Is he?
Can you confirm? >>106222625
Anonymous No.106222683 [Report] >>106222695
>>106222672
nono. I am using light. Just not the new one. It's trash. Strength of 3 on the high one, .8 on the low one.
Anonymous No.106222687 [Report]
>>106222665
you are dirt.

>>106222674
only clears vram. for some really retarded reason it never clears sys ram.
Anonymous No.106222695 [Report] >>106222710
>>106222683
Oh. But with 2/2 steps still or higher?
Anonymous No.106222696 [Report]
>>106222681
Yes, light 2.1.
No light 2.2.

I'd shill for no light at all but I only have so many years left to live.
Anonymous No.106222710 [Report] >>106222740
>>106222695
4 steps for each pass. Using Heun on the low noise.
Anonymous No.106222719 [Report]
Anonymous No.106222726 [Report]
I'm actually wondering how many steps the low noise pass actually needs to be decent without light at all.
Anonymous No.106222729 [Report] >>106222739
>>106222671
>Heun
where do i get that?
Anonymous No.106222739 [Report] >>106222800
>>106222729
It's a sampler, same pulldown tab as where lcm is. Just be warned it's slow but for good reason.
Anonymous No.106222740 [Report] >>106222767
>>106222710
what. why heun. sorry but this sounds like bullshit.
Anonymous No.106222767 [Report] >>106222779
>>106222740
I have zero reasoning to back it up. I just know heun has higher fidelity outputs at a performance cost. There is no reason to take my word for anything.
Anonymous No.106222774 [Report] >>106223355
>>106222251
WRONG. I hated flux but discovered flux krea, its exactly what I wanted
Anonymous No.106222779 [Report]
>>106222767
no, it's fine, I know it's slower but first reaction was "surely that deepfries the output". guess i'll just stfu and try it.
Anonymous No.106222796 [Report] >>106222803 >>106222815
Ok quick comparison.

Low noise light LoRA 0.8 strength 4 steps using Heun sampler.

I'll post without the LoRA in the next reply
Anonymous No.106222798 [Report] >>106222812
>>106222534
do you have a link to the r*ddit thread or the exact lora mix?
Anonymous No.106222800 [Report] >>106222812
>>106222739
kijai's sampler doesn't have it. also their under "scheduler" for some reason
Anonymous No.106222803 [Report] >>106222815 >>106222826 >>106222827 >>106222839 >>106222870
>>106222796

4 steps using Heun sampler. Light LoRA disabled completely.
Anonymous No.106222812 [Report]
>>106222798
nta but I think he means this thread:
https://www.reddit.com/r/StableDiffusion/comments/1mkc6xf/psa_with_wan_22_combine_the_new_light_22_v2i/

>>106222800
forgot pic
Anonymous No.106222815 [Report] >>106222826
>>106222803
>>106222796

Looking at both I'd say with the LoRA was the better result, it more clearly shows she's sweaty.
Anonymous No.106222823 [Report]
don't smoke, it's not big and it's not clever
Anonymous No.106222826 [Report] >>106222846
>>106222803
>>106222815
without the lora you'd have to gen at like 20-30 steps
Anonymous No.106222827 [Report] >>106222846 >>106222846
>>106222803
4steps without the lora look that good still with heun? that's wild.
Anonymous No.106222839 [Report] >>106222855
>>106222803
What scheduler are you using with huen, simple?
Anonymous No.106222846 [Report]
>>106222827
Yeah it's basically 8 steps in practice.
>>106222826
Yeah I think looking at the results though there isn't much downside to using the LoRA for the low pass.
>>106222827
Yep. Gonna try at strength 1 next and then I'll try the low noise with Heun without the LoRA and see what happens.
Anonymous No.106222851 [Report] >>106222900
how much faster is wan on 5090 compared to 3090? thinking of upgrading
Anonymous No.106222855 [Report] >>106222872
>>106222839
Yep, Heun simple.
Anonymous No.106222870 [Report] >>106222946 >>106223116
>>106222803
Light LoRA strength 1 on low noise.

Doing no LoRA 4 steps Heun on high and 1 strength LoRA 4 steps Heun on low next.
Anonymous No.106222872 [Report] >>106222888
>>106222855
What about the ModelSampingSD3 value when using the lora? Also this is I2V right?

So you're using lcm+simple for high noise, and huen+simple for low noise, lightx2 i2v lora at 0.8 strength for both low/noise?

Just trying to make sure I have my setup similar to see if I get similar results.
Anonymous No.106222888 [Report]
>>106222872
That's correct. I have model sampling at 6.00
Anonymous No.106222894 [Report] >>106222900 >>106222904 >>106222916
>>106222625
light 2.2 is so fucking ass, what a disappointment
Anonymous No.106222900 [Report]
>>106222851
should be close to 70% faster, maybe 50%ish

>>106222894
it's insane how they fucked it up so badly.
Anonymous No.106222904 [Report]
>>106222894
I don't know why they even released it. It's actually trash.
Anonymous No.106222916 [Report] >>106222929 >>106223121
>>106222894
What's your CFG on 2.2? That blue glow looks like too high CFG.
Anonymous No.106222929 [Report]
>>106222916
both gens are 2 cfg on high, 1 on low, kijai workflow's default
Anonymous No.106222946 [Report] >>106222965 >>106223022
>>106222870
No LoRA on the high noise here. But I forgot to turn on CFG so I gotta do it again with 2.5 cfg. After that I'll try .8 then 1 strength.
Anonymous No.106222949 [Report] >>106222993
i know it's not i2v but i'll try this for t2v. 4/4, euler simple.
the reason i use fastwan lora too is.. i saw someone use it so that's pretty much it.
Anonymous No.106222956 [Report]
>>106221281 (OP)
>four pics in collage are from asian waifu footfrend
huh i wonder who the baker was
Anonymous No.106222965 [Report] >>106222971
>>106222946
cfg for the heun part right? i assume having cfg on for the high nose will deepfry it.
Anonymous No.106222971 [Report]
>>106222965
It's looking that way.
Anonymous No.106222993 [Report]
>>106222949
anon here
this was garbage. slowmotion and completely ignores prompted motion "the camera pans around"
Anonymous No.106223022 [Report] >>106223041 >>106223063 >>106223080
>>106222946
Ok that took a while. Here is LoRA off on high, 2.5cgf, heun at 4 steps. A little better than no cfg maybe. But she is double fisting cigarettes. Way too long to be worth it.

Will try heun with LoRA at .8 strength no cfg 4 steps next.
Anonymous No.106223041 [Report] >>106223051
>>106223022
lmao she pulled a magician's trick and made it disapear with a flourish
Anonymous No.106223051 [Report] >>106223098
>>106223041
You even been really drunk and accidentally scrunched a lit cigarette up in your hands?
Anonymous No.106223055 [Report]
Are you allowed to join Reform if you're not a muslim?
Anonymous No.106223063 [Report] >>106223064
>>106223022
shit takes so long to generate that it's hard to find out the ideal settings, and doesn't help that it can be very seed dependent too
Anonymous No.106223064 [Report] >>106223086
>>106223063
True. Every one of these gens might be specific to this seed and my settings will turn to shit. But I think I'm finding a good balance.
Anonymous No.106223072 [Report]
where is the heun sampler in Wanvideo Sampler, i don't get it?
Anonymous No.106223080 [Report] >>106223116
>>106223022
Okay, high noise using 2.1 light LoRA at 0.8 strength heun at 4 steps cfg 0.

Her hand is doing something weird and her skirt is coming undone from the wrong side. It's supposed to be open at the hip. Gonna try this again on lcm then crank it up to 1.
I might poke around at lower strengths to see if there is a good balance there.
Anonymous No.106223086 [Report]
>>106223064
I tried the same settings and although the quality is fine, the motion sucks.

Then again I was never a fan of using lightx2 at all.
Anonymous No.106223098 [Report]
lightx2v 2.1

>>106223051
no...
Anonymous No.106223105 [Report] >>106223115 >>106223145 >>106223172 >>106223235
i don't keep up with things very much but i see comfy shilling for models once in awhile
i saw they proclaimed this ace step as some sort of incredible advance for local music generation so i tried it out and it was a big pile of poo
comfy's getting paid to make those posts right?

in actually interesting music gen news teh dadabots guys are going to do publish a lora training method for stable audio
Anonymous No.106223115 [Report] >>106223123 >>106223133 >>106223142
>>106223105
not once have i seen comfy or anyone shill acestep.
acestep is "k" but definitely not sota
Anonymous No.106223116 [Report] >>106223144 >>106223182
>>106223080

Reverse progress. LCM at LoRa strength 0.8 turned her back into this >>106222870 woman with disastrous results.
Gonna do a switcharoo and try using heun for High and LCM for low.
Anonymous No.106223121 [Report] >>106223316 >>106223330
>>106222916
okay re-ran it with 1 cfg and that fixes some things, lightx2v still seems to be better, all same seeds.
Anonymous No.106223123 [Report]
>>106223115
poopoo peepee
Anonymous No.106223129 [Report]
>>106221343
https://files.catbox.moe/3pa9hp.mp4
Anonymous No.106223133 [Report] >>106223166
>>106223115
Pretty sure he was even talking about it in this general.
Anonymous No.106223142 [Report] >>106223166
>>106223115
Anonymous No.106223144 [Report] >>106223175
>>106223116
You're not using any other lora with light, that's why it seems like you're getting decent results.

I tried using NSFW loras with light and they always nuke the motion no matter what setting I use. It's not necessarily "slow motion", but rather more linear and less lively. Again, without light, you get much more motion fidelity.
Anonymous No.106223145 [Report]
>>106223105
>stable audio
lmao
Anonymous No.106223156 [Report]
Anonymous No.106223166 [Report]
>>106223142
>>106223133
that's wild. i take it back. i mean it makes sense he would post here, free advertizing
Anonymous No.106223167 [Report] >>106223188 >>106223192 >>106223210
>try wan 2.2 i2v on 16gb vram
>can't run fp8, swith to q6
>high noise model runs successfully, but server crashes when loading low noise model
>add clean vram node in between high low noise model swapping
>got result
>run again, low noise model still there and trying to load high noise model causes server crashes
>add clean vram after save video
>crashes at the very end of the workflow
nice to be a vramlet
I enjoy the journey
Anonymous No.106223172 [Report]
>>106223105
>are going to do publish a lora training method for stable audio
kek, it could hardly make sound effects, and was the only thing it could do at all
Anonymous No.106223175 [Report] >>106223191 >>106223275
>>106223144
not that anon, but this is true. i use light with nsfw loras and it goes from "he thrusts his hips energetcally" to "he stands still with his dick out like some virgin retard"
Anonymous No.106223180 [Report]
for the record, i don't think it's bad for comfy to take bribes to promote models
get that bag while you can
in a zero-credibility world one should embrace selling out and trust no one
Anonymous No.106223182 [Report]
>>106223116
Heun on high with light at strength 3 and lcm on low at strength .8
Slightly better, but I think lcm is giving it that airbrushed look. She should be sweaty.
Anonymous No.106223188 [Report] >>106223214
>>106223167
this isn't a vramlet issue. comfy just be like that sometimes. what's your system memory?
Anonymous No.106223191 [Report] >>106223200
>>106223175
are you using the 2.2 nsfw lora? i get too much motion with this thing
Anonymous No.106223192 [Report] >>106223214
>>106223167
are you running with torch.compile? I can run q8 with 12gb vram with it with pretty much no speed loss.
Anonymous No.106223200 [Report] >>106223211
>>106223191
"the" 2.2 lora? i'm not sure which one you mean. i tried all the current loras that work with 2.2 as well as most 2.1 (which work fine)
Anonymous No.106223207 [Report]
I guess Chroma is okay at genning MILFs in questionable office attire... v50 now just looks like SDXL gens though, which is unfortunate.
Anonymous No.106223210 [Report]
>>106223167
I have same issue with 4070ti super
Anonymous No.106223211 [Report] >>106223237 >>106223348 >>106223408
>>106223200
https://civitai.com/models/1307155/wan-22-experimental-wan-general-nsfw-model
Anonymous No.106223214 [Report] >>106223240 >>106223247
>>106223188
32gb

>>106223192
not yet
Anonymous No.106223230 [Report]
raincoat stress test
Anonymous No.106223235 [Report] >>106223249
>>106223105
nigga he shills implementation
"my tool supports this"
I doubt he gives two shits about the models
Anonymous No.106223237 [Report] >>106223277
>>106223211
neat. can't say i've tried that yet. the only thing i don't like about wan loras is that their activation usually requires a specific sentence which is bothersome to have to remember for all poses.
Anonymous No.106223240 [Report] >>106223465
>>106223214
>32gb
Yeah, if you have 16gb vram you need to offload a lot when using 2.2, even with quantization, get 64gb system ram
Anonymous No.106223247 [Report] >>106223465
>>106223214
32gb.

You aren't a vramlet. Well, you are. But that's not your issue. You're a ramlet. Thankfully that's a cheap-ish fix.
Anonymous No.106223249 [Report]
>>106223235
he slides his own shill into the model announcement but he always says it's good when things are clearly shit
Anonymous No.106223261 [Report]
3 steps with heun on high and low with both LoRAs at 1.

Idk at this point. Heun makes better motion with the same settings but the output is wildly different to lcm. Heun for sure works well on the low noise for high fidelity outputs compared to the airbrushed look of lcm though.

I'm just gonna tool around for a while and report back if I find anything interesting.
Anonymous No.106223275 [Report]
>>106223175
yeah, with light lora the man fucks like an old man with arthritis.

without light she has subtle facial expression changes, her ass bounces with each thrust, the guy is alternating his hip movements a bit. it's just better.

fuck light.
Anonymous No.106223277 [Report] >>106223279 >>106223288
>>106223237
i'm not joking, my dudes ravaging these women in these gens. get it before it might be gone
Anonymous No.106223279 [Report]
>>106223277 (Me)
and i'm using light
Anonymous No.106223288 [Report] >>106224163
>>106223277
idk why it would get removed but can you post a catbox of a gen using light? you got me curious now
Anonymous No.106223291 [Report] >>106223435
How do I gen with Wan 2.2 with 32 GB of RAM? Genning high noise latent, retaining it, unloading high noise model, loading low noise model and finishing the job shouldn't be that hard, right? Is it just lack of code issue?
Anonymous No.106223316 [Report] >>106223437
>>106223121
how is lightx2v better there? It's fucking up the colours and the knife is not the knife she started with.
Anonymous No.106223330 [Report] >>106223335 >>106223357
>>106223121
just realised that the 2.1 light lora even corrected the knife blade wtf
Anonymous No.106223335 [Report]
>>106223330
The light 1.1 LoRA is just better.
Anonymous No.106223348 [Report] >>106223356 >>106223372
>>106223211
>the spamming bbcope tranny poster on civit is chinese
lmao.
Anonymous No.106223355 [Report] >>106223871
>>106222774
Anonymous No.106223356 [Report]
>>106223348
lol i'm cooking something related
Anonymous No.106223357 [Report]
>>106223330
it didn't correct it. It just changed it completely. That's not good.
Anonymous No.106223370 [Report] >>106223427
how do I add details with wan i2v high/low noise setup?
do I put the image in the first frame and gen a vid with only 1 frame? but this gives me the exact same image back
Anonymous No.106223372 [Report]
>>106223348
>spamming bbcope
the what?
Anonymous No.106223375 [Report]
>>106221301
Good stuff anon. Instantly made V50 better.
Anonymous No.106223408 [Report]
>>106223211
How do I make payment processors aware of this?
Anonymous No.106223427 [Report]
>>106223370
prompt...?
Anonymous No.106223435 [Report]
>>106223291
page file if you're ready to give up on life
Anonymous No.106223437 [Report]
>>106223316
i like the movement better, even with lightning 2.2 at cfg 1. While it does fix the weird glow and some of the clipping, the motion is still not as good as lightx2v 2.1. Hell, at the end of the fixed 2.2 gen the knife is clipping into the watermelon. Lightx2v 2.1 isn't perfect, but lightning 2.2 is just ass.
Anonymous No.106223465 [Report] >>106223492 >>106223502
>>106223240
>>106223247
I'm on ddr4. does it matter if I upgrade to ddr5?
Anonymous No.106223492 [Report]
>>106223465

Your motherboard has to support DDR5 in order to use it. Just get DDR4.
Anonymous No.106223502 [Report]
>>106223465
No, not really, the speed difference between ddr4 and ddr5 has no measurable impact when offloading, just buy 32gb more of ddr4, make sure the timings are the same as those you have though for best performance
Anonymous No.106223532 [Report] >>106223539
kontext is still fun for edits

plus you can feed these into wan 2.2.
Anonymous No.106223539 [Report] >>106223547 >>106223565
>>106223532
Kontext feels like it was made by a guy with a dwarf fetish.
Anonymous No.106223547 [Report] >>106223569
>>106223539
because it was trained on openai (closedai) output
Anonymous No.106223565 [Report] >>106223706
>>106223539
since I swapped to the 2 image workflow (bypass 2nd image nodes if doing 1 image) it seems a LOT better. even full body stuff.

https://www.reddit.com/r/StableDiffusion/comments/1m5wpmv/flux_kontext_psa_you_can_load_multiple_images/

this one, also good for making pepes
Anonymous No.106223569 [Report] >>106223582
>>106223547
wdym? They released the best open source AI to date a few days ago?
Anonymous No.106223582 [Report]
>>106223569
qwen is DPO'd to absolute shit. you have to work around it by feeding it more noise
Anonymous No.106223632 [Report]
>>106221301
tried it, pc almost exploded
Anonymous No.106223697 [Report] >>106223716 >>106223726 >>106223813
>ComfyUI
>UI is not comfy at all, not even close
What did he mean by this?
Anonymous No.106223706 [Report] >>106223778
>>106223565
the man is shaking hands with the pink hair anime girl. keep his expression the same.

behold, animebrah
Anonymous No.106223716 [Report]
>>106223697
Comfyui felt very uncomfy to me as well. Now when I use any other UI I'm just frustrated on how limited they are.
Anonymous No.106223726 [Report]
>>106223697
The cake is a lie
Anonymous No.106223778 [Report]
>>106223706
Anonymous No.106223813 [Report] >>106223873
>>106223697
houdini and nuke are the only node based programs that have great node-feel
Anonymous No.106223867 [Report]
>The girl possibly has an IQ over 90 and looks like someone who listens attentively.
Looks like it's good idea to check tagger results :D I wonder how it works as prompt
Anonymous No.106223871 [Report]
>>106223355
kek, get'em old man!
Anonymous No.106223873 [Report]
>>106223813
The point of an Node editor is to be better than having to write code. Comfy doesn't follow DRY principles with Node setups which is why it feels like dogshit.
Anonymous No.106223875 [Report] >>106223897 >>106223916
are there any models that can generate non contemporary scenes? like a medieval village or something. is it a prompting issue? i find that when i try to prompt too much then the image is really blurry and weird, but a regular living room looks good.
Anonymous No.106223880 [Report]
1oldgirl
Anonymous No.106223897 [Report] >>106223921
>>106223875
sdxl can do this, even 1.5 and 1.4 can do this
Anonymous No.106223916 [Report] >>106223945
>>106223875
What model are you using that *cant* do medieval scenes?
Anonymous No.106223921 [Report] >>106223942
>>106223897
im using a porn merge xl checkpoint, maybe thats why. is there an all purpose realistic checkpoint thats the best?
Anonymous No.106223937 [Report] >>106224026
>>106221560
> also with local you can train loras or do full finetunes of the model with any music you want
Just like with mmaudio/thinksound? And with h100 in the pc?
Anonymous No.106223942 [Report]
>>106223921
xl porn merges do buildings and interiors pretty well. If you have hardware flux/chroma/qwen/wan can probably oneshot whatever you want as long as you have sane prompt
Anonymous No.106223945 [Report]
>>106223916
any super photorealistic model will struggle with things there arent actual photos of. I cant prompt for an elf without it being a girl in a cheap halloween costume with plastic elf ears.
Anonymous No.106223957 [Report]
>have different workflows that have the exact same fucking models
>switch to 2nd from 1st workflow and it loads ALL of the fucking models again

WHY! Is there an option any where for this to not unload when I fire up a different workflow?
Anonymous No.106223992 [Report] >>106224039 >>106224284
how the fuck do i use this shit aaaaaa
Anonymous No.106224004 [Report]
Anonymous No.106224007 [Report]
wan doesnt seem to like trying to make people eat properly
Anonymous No.106224026 [Report]
>>106223937
The 1.0 Ace Step model was finetunable on 16gb vram, and would be much lower with effective block swapping and/or quantization.

On my 3090 it took 5 seconds to generate 45 seconds of audio using it, audio is not very demanding.

The new model will be larger, but easily useable and trainable on local.
Anonymous No.106224039 [Report] >>106224047
>>106223992
>Third party node shit
You get what you deserve!
Anonymous No.106224047 [Report] >>106224101 >>106224116
>>106224039
man i just want something that saves the neg and pos prompts into the image so i dont have to repeat it when passing on adetailer...
Anonymous No.106224088 [Report]
wtf with lightshit 2.2 ? : /
Anonymous No.106224101 [Report]
>>106224047
I love low IQ people
Anonymous No.106224116 [Report]
>>106224047
It saves the pos / neg prompts in the image by default, what do you mean ?
Anonymous No.106224133 [Report] >>106224152 >>106224153 >>106224351
the anime girl is smoking a cigar and has a fedora on. keep her expression the same.
Anonymous No.106224152 [Report] >>106224351
>>106224133
Anonymous No.106224153 [Report]
>>106224133
that's a feather glued to a .50 cal
Anonymous No.106224163 [Report]
>>106223288
NTA but i was playing with the lora a few hours ago
https://files.catbox.moe/vphxc3.mp4
Anonymous No.106224178 [Report] >>106224191
what do i do when adetailer cant fix hands?
Anonymous No.106224191 [Report] >>106224200 >>106224212
>>106224178
inpaint
Anonymous No.106224200 [Report] >>106224212 >>106224217
>>106224191
>tfw i use comfyui
lets go expend 1 hour setting up a inpaint workflow haha...
Anonymous No.106224209 [Report] >>106224351
one more

the anime girl is wearing a chef's hat and is holding a steel cooking pan.
Anonymous No.106224211 [Report]
>>106222617
Useful, thanks. What about continuous "she is sitting..."?
Anonymous No.106224212 [Report]
>>106224191
>>106224200
I think the issue is detection, it literally doesnt change the hands...
but then again havent genned for a while so i dont even remember what is the parameter that controls this
Anonymous No.106224217 [Report]
>>106224200
thats why you install all the tools since you will need them at some point anyway, krita ai plugin/reforge
Anonymous No.106224284 [Report]
>>106223992
https://litter.catbox.moe/epxyf1wpr78fbmkr.png
Anonymous No.106224302 [Report] >>106224325
>about 1/10 times the video combine corrupts the video and makes it unplayable
>all the frames the sampler did are gone
anyone else have this problem? i have two video combine nodes just in case
Anonymous No.106224314 [Report] >>106224347 >>106224365
when looking at prompts, when a prompt has something like (bends over 1.2) what does that signify?
Anonymous No.106224321 [Report]
is there any way to prompt so that it assumes it's already mid-action? like it keeps starting my prompts as static scenes and _then_ something happens, but it might just be a skill issue on my part
Anonymous No.106224325 [Report]
>>106224302
comfy will skip any steps that are identical to the last gen, just set the seed back and run it again
Anonymous No.106224346 [Report]
>>106224342
>>106224342
>>106224342
>>106224342
Anonymous No.106224347 [Report]
Has anyone tried the kohya sd-scripts update (sd3 branch) that implements chroma training? Has anyone gotten it working? I'm running into trouble with the training command.

>>106224314
Think of it as a multiplier for the term in the prompt. In this case "bends over" will be evaluated at 1.2x.
Anonymous No.106224351 [Report] >>106224370
>>106224133
>>106224152
>>106224209

I have just realized I can make a profile picture with AI
Anonymous No.106224365 [Report]
>>106224314
you mean (bends over:1.2)? that increases the prompt weight for whatever is inside the parentheses
Anonymous No.106224370 [Report]
>>106224351
galaxybrain-kun...
Anonymous No.106224865 [Report]
>>106222222
super wasted
Anonymous No.106224884 [Report]
Anonymous No.106225474 [Report]
Anonymous No.106225561 [Report]
>>106221281 (OP)
Noob here, I'm checking with novaPixelsXL.

I'm applying the recommended settings.

Prompt: pretty girl, pixel art, dithering, 16-bit style, illustration, ultra-detailed, vibrant colors, sprite, game art.

And it's not working...

Is there something I'm not understanding?