← Home ← Back to /g/

Thread 106285704

333 posts 198 images /g/
Anonymous No.106285704 >>106285718 >>106285775 >>106286013
/ldg/ - Local Diffusion General
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106282974

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://github.com/Wan-Video
2.1: https://rentry.org/wan21kjguide
2.2: https://rentry.org/wan22ldgguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-HD/tree/main
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.106285717
/adt/ it's dying and you're here in a new thread... Shame on you, /ldg/! You killed your little brother, the VRAMlets refugee, the third world gooner's space, you killed it!
STATLER No.106285718 >>106285723
>>106285704 (OP)
i told ya the ldg slop girl was collage bait beahagaha
Anonymous No.106285721 >>106285727 >>106285729 >>106285770
Qwen Image Edit model waiting room.
WALDORF No.106285723
>>106285718
there's one good thing;
It's not like this thread can get any worse
BEAHAGHaHAH!!!!
Anonymous No.106285727 >>106285747
>>106285721
the nose ring is a sign of ultra low intelligence.
Anonymous No.106285728
schizo crashout
Anonymous No.106285729
>>106285721
impressive, if only it was ok with nsfw
Anonymous No.106285742 >>106285755 >>106285767
Chroma training waiting room.
What levels are your Chroma, anons?
Anonymous No.106285746 >>106285768
Blessed thread of frenship
Anonymous No.106285747
>>106285727
>he needed a sign other than a being resembling a female
Anonymous No.106285755
>>106285742
Mine v250, now the feet look somewhat human.
Anonymous No.106285762 >>106285773 >>106285784
when can i use a1111 for wan instead comfyui
Anonymous No.106285767
>>106285742
v48GODS got vindicated
v50 is for anatomycopers
Anonymous No.106285768
>>106285746
/ldg/ is my literal toilet :3
Anonymous No.106285770 >>106285778
>>106285721
why doesn't it fix the fucked up hand?
Anonymous No.106285773
>>106285762
If you're Forgeanon, you're at the limit of any ldg anon before moving on to Comfy. And that is:
A111 is abandoned. The last support is poor and buggy compatibility with Flux.
Anonymous No.106285775
>>106285704 (OP)
Thanks Bread, nice memes everyone
I'll just leave this here
ehheehe
https://rentry.org/mwarchive
Anonymous No.106285778
>>106285770
It's a real photo
Anonymous No.106285784 >>106285798 >>106285807 >>106285808
>>106285762
https://github.com/deepbeepmeep/Wan2GP
here you go retard
Anonymous No.106285798 >>106285805 >>106285807 >>106285810
>>106285784
>cant send lora to system ram
worthless
Anonymous No.106285805
>>106285798
the program is made for vramlet and ramlet retards, so it doesnt expect the user has any spare resources
Anonymous No.106285807
>>106285798
>>106285784
like literally "for low vram poor" but you cant offload shit to my 64gb of system ram
pointless shitware
Anonymous No.106285808 >>106285812
>>106285784
>deepbeepmeep
Something about this guy susses me out.
Anonymous No.106285810 >>106285822
>>106285798
it does. that is pretty much standard
Anonymous No.106285812 >>106285815
>>106285808
you're just a schizophrenic retard refuses to use Gmail
Anonymous No.106285815 >>106285828
>>106285812
I have a gmail. wtf?
Anonymous No.106285822
>>106285810
nope. if i stack wan lora they "arent loaded" it only lets me use vram
there are NO settings to change
its babby wan "i can only click my ipad screen" software
Anonymous No.106285824 >>106287888
Anonymous No.106285828 >>106285832
>>106285815
then why didnt you GEN JAM!??
Anonymous No.106285832 >>106285839
>>106285828
You've touched on a very important point. Despite having a gmail, I don't share it willingly.
Anonymous No.106285835 >>106286031 >>106286078 >>106289188
Anonymous No.106285838
Anonymous No.106285839 >>106285874
>>106285832
so yes you are schizophrenic
Anonymous No.106285874 >>106285895
>>106285839
nta, post your gmail
Anonymous No.106285880 >>106285917
Anonymous No.106285891
so in console after "requesting to load wan21" it'll say loaded partially or loaded completely and some numbers. What does this mean and how to I interpret the numbers?
Anonymous No.106285893
How do models just KNOW asuka?
Anonymous No.106285895 >>106285903
>>106285874
anistudio14@gmail
Anonymous No.106285903
>>106285895
and your phone number associated with the account like most people have since it was mostly mandatory?
Anonymous No.106285917 >>106285992
>>106285880
my beautiful wife
Anonymous No.106285957 >>106285973 >>106286905
Anonymous No.106285965 >>106285979
In forge, when I want to use a model. let's say sd3.5_medium; then what do I need to do?
Do I just need to put the safetensors file in the model folder or are there other files or things I need to do to 'install' it correctly?
When I look at huggingface there are a lot of other files and folders.
Anonymous No.106285973
>>106285957
Unironically better animated than the show.
Anonymous No.106285979 >>106285986
>>106285965
I can't believe you're still trying to use SD3.5 after everyone told you its dead on arrival garbage.
Anonymous No.106285984 >>106286008 >>106286205
the camera zooms out far on a man that picks up an ak-47 and fires it directly at the camera several times with a flash from the barrel with each shot, at a press conference.
Anonymous No.106285986 >>106285997
>>106285979
let's say sd3.0 then.
Anonymous No.106285990
what is lightning that everyone talks about
Anonymous No.106285992
>>106285917
She's a little clumsy.
Anonymous No.106285997 >>106286007
>>106285986
That's literally worse. Why do you want to use these? I do not get it.
Anonymous No.106286007 >>106286017 >>106286020 >>106286026
>>106285997
I've had good experiences with sd3 on some simple free online generators.
Tell me why they're bad and what to use instead.
Anonymous No.106286008 >>106286205
>>106285984
Anonymous No.106286012 >>106286110
Anonymous No.106286013 >>106286076
>>106285704 (OP)
Well finally got wan2gp running with 2.2 5b, took about a minute for three shitty seconds of low quality video, and I decided making videos just isn’t for me yet. Reject chasing the tech dragon, come back to base noobai and it’s shitty output Kek
>picrel
Anonymous No.106286017 >>106286049
>>106286007
not him but for anime, wai v14 (illustrious/noob) is far better and does far better anatomy.
Anonymous No.106286020 >>106286049
>>106286007
flux for sfw stuff
chroma if you want to coom
Anonymous No.106286021
Anonymous No.106286026 >>106286049
>>106286007
They have garbage anatomy, are censored, almost completely unsupported by LoRAs and fine tunes and there are must better models like, any SDXL finetune, flux, chroma, qwen etc.
Anonymous No.106286031
>>106285835
Best post these threads have seen in a long time. Course it’ll fly under the radar
Anonymous No.106286044
does wan train on all the frames of high framerate videos
is that why the output often looks slowmo
Anonymous No.106286047
ive tried and tried and tried but WAN CANNOT do the sph pinching gesture natively with prompting
Anonymous No.106286048 >>106286099
>erm guys how do I use sd1.5 in A1111
How are you people so easy to bait?
Anonymous No.106286049 >>106286069 >>106286070
>>106286017
>>106286020
>>106286026
Right now I want to do some landscape/cityscape type stuff
Anonymous No.106286069
>>106286049
I guess flux then.
Anonymous No.106286070
>>106286049
flux or qwen are good for realism, illustrious/noob for anime, but there are even realistic illustrious checkpoints too that can work.
Anonymous No.106286076 >>106286113
>>106286013

5b is completely useless. If you're vram limited you can use quants of the full model at a tiny resolution.
Anonymous No.106286078
>>106285835
lol. holy shit. so good.
Anonymous No.106286079 >>106286131
Anonymous No.106286085
first time seeing anon i2v a /g/ maymay?
Anonymous No.106286099
>>106286048
That wasn't the question. The question was, is the checkpoint file the only thing you need, or do you need any of the other stuff that come in the git repo
Anonymous No.106286110
>>106286012
2.1 is still great. also would
Anonymous No.106286113 >>106286308 >>106286323
>>106286076
Maybe I’ll finally bite the bullet at some point and see about setting it up in comfyui. Wan2gp is jank as fuck. I prefer the webui style but that thing is a piece of shit tbqhwy
Anonymous No.106286118 >>106286127 >>106286141
what are the most useful loras / addons for wan2.2
people mention lightning or light. idk what that is
just tell me ok
Anonymous No.106286127 >>106286214
>>106286118
Light but only the 2.1 version. The 2.2 version kills motion and blows out the colors. Everything else is just porn.
Anonymous No.106286131 >>106286152
>>106286079
your 2.1 gens have great quality. no loras used?
Anonymous No.106286134
Anonymous No.106286141 >>106286214
>>106286118
lightx2v was the old name for 2.1, lightning is 2.2

These loras allow low step counts / short gen times to make decent videos. There's a quality and prompt adherence penalty but there's workflows to mitigate that
Anonymous No.106286149 >>106286926
Note, FP16 wan 2.2 model is worlds better than FP8 pre-quanted even when you use the FP16 one with kanji's fp8 quanting.

Hopefully numchucku comes out soon as that is closer to FP16 than 8 bit is while being as smaller as 4bit
Anonymous No.106286152
>>106286131
Oh the filename is wrong. It's 2.2
Anonymous No.106286163
god I hate coping with quants at 24gb
Anonymous No.106286178
Anonymous No.106286184 >>106286204
Same exact settings but here is FP8 vs FP16
FP8:
https://files.catbox.moe/ac1xvy.mp4
FP16:
https://files.catbox.moe/qsxjhb.mp4
Anonymous No.106286186 >>106286221
>get caption with joycaption beta
>put into chroma
>looks like real image
>change prompt slightly like enormous breasts or whatever
>now it is incoherent deformed slop
is there a way around this problem
Anonymous No.106286202
unlimited length video gens may be a thing eventually
https://derewah.dev/projects/self-forcing-endless
https://github.com/Dere-Wah/Self-Forcing-Endless
Anonymous No.106286204 >>106286219
>>106286184
Now do Q8. FP8 is fucked.
Anonymous No.106286205
>>106285984
>>106286008
whats with the har dick for putin
gen him shiddin in a diaper
Anonymous No.106286206
Anonymous No.106286214
>>106286127
>>106286141
thanks

is there any benefit to using it if I use the full number of steps?
Anonymous No.106286219
>>106286204
https://files.catbox.moe/5zr5c8.mp4
Anonymous No.106286221 >>106286230
>>106286186
Wait for the retrain to see if that fixes it or train a lora using diffusion-pipe or ai-toolkit
Anonymous No.106286225
The bulge was supposed to be in her crotch but okay.
Anonymous No.106286227
Anonymous No.106286230 >>106286238
>>106286221
Sa you could also try using v48 if you're not already using that
Anonymous No.106286236
why the fuck couldn't 2.2 just be simple to use with fucking loras. this fucking black magic 5 steps shit is so obtuse
Anonymous No.106286238 >>106286255
>>106286230
should I get v48 detail calibrated
Anonymous No.106286242
Anonymous No.106286251 >>106286263 >>106286266
reminder the 7900 XTX is a sleeper AI GPU and excellent value and VRAM for the price, as long as you use Linux
Anonymous No.106286255
>>106286238
Some anons say that base v48 is better but the people involved with making Chroma recommend detail calibrated. If you can only download one then get detail calibrated.
Anonymous No.106286263 >>106286293
>>106286251
>not having all the latest optimizations including nunchucu which makes qwen image take less than 10 seconds and will make wan take like 30 seconds
Anonymous No.106286266 >>106286273 >>106286278 >>106286293
>>106286251
Wouldn't this be a dead end for training anything
Anonymous No.106286273 >>106286278 >>106286293
>>106286266
and a dead end for actually decent speed, a 4060 would be much faster
Anonymous No.106286278 >>106286293
>>106286266
>>106286273
and if you needed 24GB then a used 3090 would be cheaper and like 4x faster
Anonymous No.106286293 >>106286302
>>106286263
nunchaku is a bitch rn but I have faith we'll get an AMD SVDQ implementation some day.

>>106286266
no idea, I just gen. have any AMDfriends ITT tried training lately?

>>106286273
>>106286278
the 7900 XTX is ~3090 speed currently.
Anonymous No.106286296 >>106286303 >>106286349 >>106286352 >>106286371
>>106286253
>>106286174
>>106286089
>>106286030
Please, tell your schizo to fuck off.
Anonymous No.106286302
>>106286293
>the 7900 XTX is ~3090 speed currently.
not if you count all the optimizations AMD cant use, not even talking about nunchaku which is cuda
Anonymous No.106286303
>>106286296
Report them. We can't even get our schizos to fuck off.
Anonymous No.106286308
>>106286113
Kek
Anonymous No.106286313 >>106286320 >>106286336 >>106286342 >>106286368
why do companies insist on fucking up this tech in the cradle with shitty fucking pricing and design decisions. every fucking area in the field is filled with this grifter ass mentality and it just ruins fucking everything. even fucking comfy is in on it and he gives out the fucking code for free. makes no goddamn sense why people eat this shit up when it could be so much better
Anonymous No.106286317
Anonymous No.106286320
>>106286313
gamers don't need more than 12GB and the "enthusiast" has the RTX PRO 6000
Anonymous No.106286323
>>106286113
Yeah it's not great. Although I have to wonder why they even made the 5b model at all. It needing a minimum of 720p makes no sense. For what it's worth I gen ideas at 512 to keep it faster.
Anonymous No.106286326
https://danbooru.donmai.us/posts?tags=rape_face&z=5
Anonymous No.106286334
Anonymous No.106286336
>>106286313
Because AI tech attracted the same retards that turned crypto from something potentially useful and transformative into a get-rich-quick scheme. Everyone involved is there so that they can earn a quick buck before the bubble pops.
Anonymous No.106286342 >>106286347
>>106286313
>in the cradle
A baby this old should be walking on his own two legs already. How many billions of VC money does it need before that?
Anonymous No.106286347
>>106286342
just one more $1 trillion nuclear powered datacenter anon please anon just one more and then it'll finally live up to the hype you have to believe me
Anonymous No.106286349
>>106286296
false flagging on 4chan is the final stage of autism
Anonymous No.106286352
>>106286296
If you want to have your own sovereign thread, you must know how to tame your own shizos.
Anonymous No.106286368 >>106286407
>>106286313
>@grok why does profit seeking ruin everything?
your misery is the natural state of capitalism. you've believed the lie all your life that capitalism creates competition which creates optimal products through market pressures. the reality is that capitalism creates profit seekers which creates enshittified products through profit maximization
>but no one will want the product if its shit
ah, no one will want the product long term if its shit, but you're stuck with it short term. and if all the other market players also race to the bottom, then there will never be a good product to supercede the bad products

tldr, any time a question starts with "why do companies", the answer is always: capitalism working as intended
Anonymous No.106286369 >>106286426
Anonymous No.106286371
>>106286296
I would schizopost there myself if it wasn't full of loli
Anonymous No.106286381
Chroma paper when
Anonymous No.106286407 >>106286415
>>106286368
Nice colors, how did you gen it?
Anonymous No.106286415 >>106286723
>>106286407
chroma v50
Anonymous No.106286425
Back, fag
Anonymous No.106286426
>>106286369
scary
Anonymous No.106286428 >>106286439 >>106286445 >>106286448 >>106286452 >>106286462 >>106286683
Probably a noob question but how the heck do you properly β€œadjust” two separate characters? I keep trying to get like a man and a woman like that classic β€œsave me strong man” type image from movies and such, and no matter how I adjust it just keeps giving my man the woman’s traits I enter lmao. Great I guess if I wanted corny beefcake coom, but I actually want a man and a woman lol. Picrel a seed finally put woman in there instead of a black guy lmao. It’s just the base noobaixl model on forge classic
Anonymous No.106286439
>>106286428
DAMN HE THICC!!!!!!!!!!
Anonymous No.106286445 >>106286469 >>106286681
>>106286428
Regional prompting or inpainting is the only surefire way to do this with XL. Just prompting it has a chance of fucking it up.
Anonymous No.106286448 >>106286469
>>106286428
SDXL based model's clip causes prompt bleed. It's kind of unavoidable.
Anonymous No.106286452 >>106286469
>>106286428
model?
Anonymous No.106286462 >>106286681
>>106286428
what you do is this: take that image and inpaint the characters with the appropriate prompt. for the guy remove the girl prompt parts and vice versa for the girl.
Anonymous No.106286465 >>106286468 >>106286473
How do we ensure that AI systems are aligned with human values and intentions?
Anonymous No.106286468
>>106286465
wrong thread sam
Anonymous No.106286469 >>106286609
>>106286452
Noob ai xl vpred. From the meta rentry >>106286448
>>106286445
hmm, I guess I could give in painting a shot I hadn’t even thought of it.
Anonymous No.106286473
>>106286465
By their very nature of being man made.
Anonymous No.106286500 >>106286510 >>106286521 >>106286976
Just going to say wan 2.2 lora's work fine in 2.1 at least the low noise ones do, they work magically. Wan 2.1 is still fucking amazing you just have to learn out to prompt it better.

i said already how i prompt it and it mostly always works with my style, a series of statements rather than a story or a schizo prompt...

the woman is Asian.

the woman has long black hair.

the woman is nude.

etc, trust me on this it will really improve shit for you, there is no better way to prompt it.

always be very deliberate in your prompts, read it like a robot, that is what this model understands.
Anonymous No.106286510 >>106286537 >>106286976
>>106286500
>the woman is Asian.
>the woman has long black hair.
>the woman is nude.
Amazing these are my prompts ad verbatim.
Anonymous No.106286521 >>106286549
>>106286500
Yeah but why would I use 2.1 when 2.2 exists?
Anonymous No.106286535 >>106286538
>qwen image
verdict?
Anonymous No.106286537 >>106286976
>>106286510
it works, you just do that in smallest blunt statemetns and it works so much better, can't believe i didn't do it earlier.

The woman faces away.

the man lies behind the woman with his erect penis.

the woman moves up and down sliding the mans penis into her buttocks repeated motion.


its so fucking easy bro, i can do anything with thing now that i understand it. lora's in general need low strength like 0.3, character lora need like 1.00
Anonymous No.106286538 >>106286569
>>106286535
meh
Anonymous No.106286549 >>106286562
>>106286521
>2.2 exists?
its not worth it, its fucking slow and a pain in the ass, its really does not give much over 2.1 in terms of quality.

its good but really its not needed for simple gens.
Anonymous No.106286562 >>106286583 >>106286658
>>106286549
>its really does not give much over 2.1 in terms of quality.
Yes it does. Maybe if all you're genning is a static shot of one naked women it's not a big leap.
Anonymous No.106286563 >>106286578 >>106286594
You could use WAN to animate stills and then take frames from that and use them for a comic, seems bit backwards, but it's a thought
Anonymous No.106286569 >>106286604 >>106286857
>>106286538
Choose your bootleg warcraft slopfu.
Anonymous No.106286578 >>106286593
>>106286563
I would be more excited for the prospect to animate never adapted manga and comics into videos
Surely someday a weeb will come up with a pipeline for that, right? I am surprised no one has yet
Anonymous No.106286583
>>106286562
for gooning when i want its really juts better to load 1 gguf for wan 2.1 anon... really with my prompts i get more or less the same result as wan 2.2...

i've stopped using pony etc simply because i can gen far far easier with wan with absolutely predictability and reproducibility over almost all seeds.
Anonymous No.106286593 >>106286775
>>106286578
already millions of nodes if it were to be done in c*mfy. we really need something else to work with
Anonymous No.106286594
>>106286563
You can also use it to make more images in a dataset for a LoRA.
Anonymous No.106286604
>>106286569
thats a very ugly orc
Anonymous No.106286609 >>106286666 >>106286681
>>106286469
Welp, not sure i know how to do this inpainting business. its just making a tranny now lol.
Anonymous No.106286658
>>106286562
i'm sleepy but in the morning or when i wake up i will post some complex long prompts for difficult porn positions. one of my favorites is prone bone but videoed from behind both subjects, so you seed the mans dick fucking really owning her ass. that's actually hard to do with just sdxl based models in most cases. and its hard with wan unless you know how to tell wan exactly what is in the scene.

i need to sleep right now or i will die.
Anonymous No.106286666 >>106286685 >>106286747
>>106286609
can you guys find another channel? it's a blue board.
Anonymous No.106286671
>The wolf gallops
>Gives it hooves.
Anonymous No.106286676 >>106286711
the blue hair anime girl on the left takes out an ak-47 and points it at the red hair anime girl.
Anonymous No.106286681 >>106286801
>>106286609
ok i guess you really have to just spam until a seed lines up and make sure you prompt to make her fit in the painted area, it sorta looks ok after a few img2img loopbacks. next thing to tackle is that fried out look.
>>106286445
>>106286462
thanks for the tip lads honestly never tried to paint in a character like this before.
Anonymous No.106286683 >>106286801
>>106286428
Without any irony, an anon dropped a how2 guide on Regional Prompting and Forge Couple in the latest /adt/ thread >>106265545. It's one of the better explanations I've seen breaking down both tools in a way that actually makes sense. Worth checking out especially if you're trying to get multiple characters or regions working properly in your gens. The explanations are straight to the point.
Anonymous No.106286685
>>106286666
blue is part of the red white and blue, faggot.
Anonymous No.106286711
>>106286676
this time, they are frens
Anonymous No.106286723 >>106286728
>>106286415
Oh nice! Can share it in the anime thread? Do you have more exaples? This are copyrighted characters or OCs?
Anonymous No.106286728 >>106286747
>>106286723
>Trying to poach users for the anime thread

Why does it even exist in the first place? Why can't you just post anime here?
Anonymous No.106286747 >>106286767
>>106286728
retards like >>106286666 reee about "their" threads getting posts they don't like. these ai threads are turbo sperg magnets.
Anonymous No.106286767
>>106286747
fair enough desu
Anonymous No.106286775 >>106286929
>>106286593
Probably how it would work:
1 - A node to iterate over a manga's / comic's pages
2 - A node to identify the frames within that page
3 - A node/flow to somehow identify known characters from those frames and their positions
4 - A node/flow to orchestrate things with LLM calls: to build payloads for TTS with designated voices for those characters, how each frame should be colorized, and identify if any frames belongs to the same scene, or are from different scenes; return a scene description object with their frames
5 - An image editing node/flow to remove speech bubbles and colorize the frame (in case it's a manga), using instructions from the LLM
6 - A TTS synthesis node/flow based on the payloads by the LLM
7 - Video gen based on conditions: determine if the scene object is just one frame (in that case, it would do only I2V), or multiple frames (would probably have to use a FLF2V model or VACE) . Would have to use something like multitalk to make characters move their mouths (speak) in the scene if any TTS audio track is available, the tricky part is making certain voices associated with certain characters when there are multiple characters in the same scene.
8 - An SFX synthesis node/flow that produces sound effect based on the video output (there are models that do that, I forgot the name)
10 - Merge the video and audio tracks together
11 - Concatenate all videos sequentially.

No soundtrack though since local is garbage at music gen and will probably be for a long time.
Anonymous No.106286801 >>106286829
>>106286681
gave it a finishing pass or two in a model called "oneobsession" that's more stylized anime tuned than the og Noob AI XL, and gotta say, i'm pretty satisfied. thanks anons, genning infinite one girls was starting to get a bit less fun but now i can control things more.
>>106286683
hmm, thanks, may have to take a look at it.
Anonymous No.106286809
Anonymous No.106286812
Anonymous No.106286829
>>106286801
for more than 200 years, it was considered offensive to wear the american flag as a garment. only since the 80s (reagan) and onward, has this point of american culture fallen on deaf ears. the "patriots" no long revere the philosophy of america, only its symbols. just how modern christians would call jesus a socialist yet they still wear the cross. what a twisted modernity
Anonymous No.106286857 >>106286860
>>106286569
>green elf
Anonymous No.106286860
>>106286857
>Chinese orc
Anonymous No.106286873 >>106287015
Anonymous No.106286905
>>106285957
how did you make it so she wasn't talking and talking?
Anonymous No.106286925 >>106286944
the man turns around and jumps off a bridge into the water.

you ok johnny?
Anonymous No.106286926 >>106286935
>>106286149
I didn't see any difference at least between fp16 and fp8_scaled, so what did you see in your tests?
Anonymous No.106286929
>>106286775
this is your brain on NODES kids. remember, not even once
Anonymous No.106286935 >>106286995
>>106286926
>I didn't see any difference at least between fp16 and fp8_scaled
Are you blind. fp8 scaled butchers the outputs.
Anonymous No.106286944
>>106286925
better
Anonymous No.106286954
Anonymous No.106286955
Note to self: DO NOT pick 60 images to i2v in a row. 10-20 is a more reasonable size.
Anonymous No.106286956
Is there any point in cfg >1 in the low noise/refiner stage in wan?
Anonymous No.106286973 >>106287005
>3k€ for a proper 5090 in my country
>2200€ for a literal who version that will probably melt the cable on first video gen
I hate being poor
Anonymous No.106286974
the man flies into the sky like a rocket, with rocket trails from his feet.

neat
Anonymous No.106286976
>>106286500
>>106286510
>>106286537
I guess this is for t2v? For i2v I had this purple prose writing through LLM where I made paragraphs of motion descriptions, but in the end what worked the best where just a few sentences of what you want to go on, and only add details whenever a gen looks weird on one part.
Also using simplified chinese instead of english is apparently better, though I can't confirm that.
Anonymous No.106286995 >>106287001
>>106286935
Nope, fp8 maybe, but fp8_scaled is the same as q8 from my tests.
Anonymous No.106286999
the man flies into the sky like a rocket, with rocket trails from his feet, while spinning and firing two silver pistols.
Anonymous No.106287001 >>106287007
>>106286995
Wrong. It's shit.
Anonymous No.106287005 >>106287026 >>106287028
>>106286973
>>3k€ for a proper 5090 in my country
which one, the fe? what country is that?

>2200€ for a literal who version
they're all the same anon, the only differences are cosmetic and how loud they are (except the aio versions and the fe that is thinner)
Anonymous No.106287007
>>106287001
Whatever floats your boat.
Anonymous No.106287015 >>106287025
>>106286873
damn. what's the prompt?
Anonymous No.106287025
>>106287015
Anime style, 3D animated footage, A girl wearing samurai themed leather bikini armor with hair styled in ponytail is walking through a dark dungeon. The only light is the torchlight from the torch she is holding. She approaches a door on the wall beside her and opens in. From the other side of the door a skeleton can be seen chained to the wall. It is wearing a tattered shirt that reads "Chroma". 4K cinematic, HD,
Anonymous No.106287026
>>106287005
>the only differences are cosmetic and how loud they are
Agreed, nvidia has a lot of control over what they allow AIB to do, which has a pro of a "minimal quality", but a con of "all the cards are basically the same".
Anonymous No.106287028 >>106287047
>>106287005
astral is built different idk about the others but at the very least astral won't catastrophically melt since it will cut off power if one pin fails

and my bad it's 2299€ for the cheapest one
>inno3d rtx 5090 X3
my 970 was from them and it broke after 2 years never again
Anonymous No.106287042
Anonymous No.106287047 >>106287088 >>106287090
>>106287028
The astral is 50% more expensive than the fe, at this price I'd gamble anything else, especially as their "pin protection" doesn't actually do any stopping, it only warns you and only if you use their program for that.
You are better off just power limiting the cards at 400-450W especially for genning.
And it's not like all of them burn, I have a 5090/4090 and 2x3090, all of them are perfectly fine, all fe too.
Anonymous No.106287051
what's hardware requirement for training wan 2.2 loras? and how many videos are needed?
Anonymous No.106287056 >>106287087
Anonymous No.106287077 >>106287082
Training my own pix2pix model, chuds.
Anonymous No.106287082
>>106287077
make sure comfy gets the implementation before any other ui
Anonymous No.106287085 >>106287162
I'm honestly surprised that it was able to animate this
Anonymous No.106287087
>>106287056
Speaking of it, real question for Wan users: is it possible to add smoke to fart videos and make the butt slightly shake while doing so?
Anonymous No.106287088 >>106287139
>>106287047
reason I got the astral is waterblocking the FE is a massive pain in the ass. and everyone I've talked to with an FE says that shit is loud as fuck
Anonymous No.106287090
>>106287047
>especially as their "pin protection" doesn't actually do any stopping, it only warns you and only if you use their program for that.
if a pin fails it shuts down instead of letting the whole damn thing melt
of course at that point it needs repair but it's still better than nothing

>And it's not like all of them burn
of course not but I really can't afford it happening to me at that price
Anonymous No.106287139 >>106287199
>>106287088
I wanted to get an AIO but I felt like the resell value would be shit in a few years whenever the internal liquid needs refill or the whole card would need painful cleaning or the pump would inevitably fail.

>of course not but I really can't afford it happening to me at that price
Sure. If that can reassure you, my fe is basically running 14-18h per day (genning) at 460W and it's hot as fuck outside, with no cable problem at all for 2 months straight.
Anonymous No.106287147
>no chroma lora tag on civitai
it's over
Anonymous No.106287162
>>106287085
Pretty convincing physics too.
Anonymous No.106287199 >>106287227
>>106287139
if you are thinking in terms of "value" just do cloud. Even if you are actually using the shit that often (you aren't) it is still better "value". buying a 5090 is hobbyist tier. fucking ada6000 is cheaper than 5090 on cloud, and 48gb unlocks so much.
Anonymous No.106287227
>>106287199
Well I don't pay my electricity due to my job so I have other incentives than a normal person, and also in general as a hobbyist I like to own my stuff and not rely on "cloud" if I can help it.
Anonymous No.106287240
anyone uses this?
https://github.com/FlyMyAI/flymyai-lora-trainer
Anonymous No.106287268
Anonymous No.106287291 >>106287296 >>106287334
How dificult is to make my own models that solve a very specific single task instead of a big model that does everything?
Anonymous No.106287294
Anonymous No.106287296
>>106287291
is that task titties perchance?
Anonymous No.106287316
Is there any ETA for nunchaku for wan?
How much time do they usually take to adapt the models? Months?
Anonymous No.106287328
Anonymous No.106287334
>>106287291
this is a very dumb question. No insult, it is just very stupid.
Anonymous No.106287344
chroma retrained as mixture of experts (like wan) would be so good... so the last expert makes it slop free every time
Anonymous No.106287382 >>106287390 >>106287415
>wan gen (without lightvx) on a 5090: 22 minutes
>wan gen (without lightvx) on a 3090: 1 hour 40 minutes
lol.
Anonymous No.106287388 >>106287399 >>106287420
wan 2.2 gen of miku flying, from a qwen prompt of miku flying ghibili style
Anonymous No.106287390 >>106287411
>>106287382
40 steps?
Anonymous No.106287399
>>106287388
> SHUT UP SHUT UP SHUT UP SHUT UP
Anonymous No.106287411
>>106287390
yeah 40steps in 720x1280, the difference is kind of insane, and it doesn't even look like it's the vram, just architecture optimizations and raw compute
Anonymous No.106287415 >>106287454
>>106287382
You're doing something wrong
Anonymous No.106287420
>>106287388
its neat how it knows how the wind would work on the twintails, despite no actual physics engine being present.
Anonymous No.106287454
>>106287415
it's the same workflow in both cases, only difference is how I load the models and some unavoidable block swapping for the 3090, see picrel
Anonymous No.106287465 >>106287489
Idle
>Attack 1
Attack 2
Run
Guard
Evade
Taking Damage
At Low HP
Incapacitated
Triumph
Flourish
Anonymous No.106287477 >>106287553
the anime girl drinks the full glass of whiskey and places it on the table.
Anonymous No.106287489
>>106287465
It appears you've pretty much perfected this now.
Anonymous No.106287518 >>106287526 >>106287553
cool, she actually got the bottle.

the anime girl grabs the whiskey bottle on the right, and pours it into a glass.
Anonymous No.106287521
Anonymous No.106287526 >>106287537
>>106287518
if it didn't recognise it as a whiskey bottle then she'd probably grab one from out of frame, I've had that happen
Anonymous No.106287537
>>106287526
yeah if you specify something like "gets into a car" that's a good way of getting a motion gen, cause they will walk left or right to one (usually)
Anonymous No.106287553 >>106287563
>>106287477
>>106287518
soco is so fucking gross
Anonymous No.106287563
>>106287553
not my personal preference, just a random test image
Anonymous No.106287567
Qwen is really good
Anonymous No.106287610 >>106287841
Anonymous No.106287627 >>106287640
Qwen will be forgotten when the new chroma epoch drops
Anonymous No.106287635 >>106287645 >>106287701 >>106287745 >>106287760
we all will be forgotten when we die
Anonymous No.106287640
>>106287627
I don't think many people are using qwen anymore. wan is much more fun
Anonymous No.106287645
>>106287635
Nah just gotta be like Hitler and nobody will forget you.
Anonymous No.106287652
is there any sort of like, local ai audio gen? not just like text to speech or whatever, but like, genning audio for like a video, like sound effects n such?
Anonymous No.106287653
I think I broke my graphics card
Anonymous No.106287701
>>106287635
whats the point in being remembered? nothing is infinite, not even the longest memory. you're just as forgotten as anything else
Anonymous No.106287712
just remember to subscribe and comment
Anonymous No.106287745
>>106287635
can't wait
Anonymous No.106287749
elf-hugger No.106287760
>>106287635
Well, that's a relief.
Anonymous No.106287780 >>106287859
the pink hair anime girl wearing a white dress and white wings stands up and dives into the water, and a large tidal wave rises from the water.

neat water fx
Anonymous No.106287814
tips on making captions in lora dataset better?
Anonymous No.106287837 >>106287860 >>106287874 >>106287878
is there a guide to gpu picking
can you just buy 2 x cheap cards
Anonymous No.106287841 >>106287862 >>106287866
>>106287610
Qwen had the potential, I truly wanted to believe in it. But its supposed superior prompt adherence/art loses all potential when you can't properly prompt for artists. That means the model is highly reliant on LoRAs, same as Flux. And there's nothing wrong with Flux art/anime LoRAs aesthetically, it's SOTA on that department. Qwen was overbaked so it lacks variety. Its biggest flaw is that it's bigger than Flux dev while being censored. That means that it will likely never get a finetune. It doesn't matter that the community has seemingly adopted the model and mistaken it for a Flux replacement. The same community has neglected Chroma which simply is a much better replacement from a technical standpoint. Why? Because they tried it once, got a shit result and gave up on it. I also almost gave up on it quickly, but then I gave Chroma solid chance after being impressed by a 1girl result that I would never get out of a model like Flux. Since then, it has blown me away and handles everything I throw at it, I have not had a need to go back. The gap between this model and censored is pretty big, there are many different types of prompts, even softcore that only Chroma can handle. Got a style that it's missing? No problem, just train a LoRA. I feel like everyone at some point will have their Chroma discovery moment, but they have to give the model a chance.
Anonymous No.106287859
>>106287780
1 more
Anonymous No.106287860 >>106287913
>>106287837
Rule 1. More vram = better
Rule 2. Don't buy AMD.

There's your guide.
Anonymous No.106287862 >>106288029
>>106287841
there's a qwen lora tag on civitai but not chroma
that says about it all
Anonymous No.106287866
>>106287841
QwenGOD
Anonymous No.106287873
Anonymous No.106287874 >>106287913 >>106287945
>>106287837
RTX 5090 32 GB
it'll come in handy when models exceed 100 GB size
Anonymous No.106287878 >>106287913
>>106287837
3090
4090
5090

pick any
Anonymous No.106287879 >>106287908
Anonymous No.106287888
>>106285824
>That sexual arousal from watching sword fights
Based
Anonymous No.106287908
>>106287879

Is this how they greet each other in lewd feudal japan? Lol
Anonymous No.106287913 >>106287923 >>106288683
>>106287860
mm yes this i had understood already. will i have fun with 12 gb or will it feel like a waste after two days?
>>106287874
>>106287878
thanks lads. bit pricy innit
some 3090s selling for $1k here but they have all been used horizontally
Anonymous No.106287921
Startup idea: ai toilet cameras to optimize bidet spray patterns
Anonymous No.106287923
>>106287913
at the minimum go for 16GB vram
3090s at 1k$ used is kind of insane, but I don't know the situation in the US right now
Anonymous No.106287944 >>106287994
>will i have fun with 12 gb
No. You will be eternally frustrated.
Anonymous No.106287945 >>106287962
>>106287874
enjoy your housefire
Anonymous No.106287962 >>106287972 >>106288226 >>106288707
>>106287945
I assume most people would be undervolting these, no?
Anonymous No.106287972
>>106287962
https://www.tomshardware.com/pc-components/gpus/zotac-rtx-5090-reportedly-catches-fire-during-battlefield-6-session
undervolting can't save it either
Anonymous No.106287982 >>106288003 >>106288011 >>106288050 >>106288239
So the people who call for the death of people using AI in any form. Are they like a vocal minority or do most people feel this way?
Anonymous No.106287994 >>106287996
>>106287944
okay. i see a 5070 ti for 300 bucks more than what i had in mind. 4 gb more then
Anonymous No.106287996 >>106288045
>>106287994
Go for at least 24 or you'll regret it.
Anonymous No.106288000
Anonymous No.106288003 >>106288024 >>106288048
>>106287982
People also call for the death of jews, and more people hate jews than ai.
Anonymous No.106288011
>>106287982
just don't be Indian
Anonymous No.106288020 >>106288030
DO NOT animate this.
Anonymous No.106288024
>>106288003
For good reason
Anonymous No.106288029 >>106288039 >>106288052
>>106287862
You know anon, there's a way to prompt Chroma and get Qwen tier slop. I know because I have been testing this model for so long, pic rel was from a prompt I ran on v29. The model that wins is the model that gives you most control over style, and that goes overwhelmingly to Chroma.
Anonymous No.106288030
>>106288020
Eva Braun was such a waifu
Anonymous No.106288039
>>106288029
whatever you say, anon
the fact is there's no chroma lora tag on civitai
Anonymous No.106288042
Has anyone encountered the phenomenon of too many steps of the high noise pass just deep frying the output?
Anonymous No.106288045 >>106288056
>>106287996
back to the used 3090 for a grand maybe then. what to expect for longevity? already like five years old
Anonymous No.106288048 >>106288054
>>106288003
Jews made ai
Anonymous No.106288050
>>106287982
I remember when the "We have to kill ai artist" spammer on reddit got banned cause ai flagged it as abusive hate material.
Anonymous No.106288052 >>106288109
>>106288029
The problem with chroma is that while it will give you what you ask for. There's always dozens of visual artifacts and nonsense objects in the image that make them basically useless.
Anonymous No.106288054 >>106288057
>>106288048
no, they fund it
Anonymous No.106288056
>>106288045
Undervolt it and make sure it's kept well ventilated and I doubt it will die any time soon.
Anonymous No.106288057 >>106288061
>>106288054
Angloids are basically jews
Anonymous No.106288061
>>106288057
WHAT AN INSULT
terrible
Anonymous No.106288109 >>106288122
>>106288052
>dozens of visual artifacts and nonsense objects in the image

That is not true. Nonsense objects can be fixed with a better prompt. Visual artifacts can also be improved and if a concern entirely removed by asking for a certain type of slopped image. The alternative is having zero control over the image and getting a result that does not align with the prompt.
Anonymous No.106288122 >>106288148 >>106288232
>>106288109
All of the gens in I've seen in this general prove you wrong.
Why do chroma fags lie why people point out the systemic flaws in the model?
Anonymous No.106288131
Anonymous No.106288140
Anonymous No.106288148 >>106288165 >>106288192
>>106288122
Because you're lying about the capabilities of whatever model you're trying to shill. Qwen (pic rel) has worse visual artifacts and nonsense objects if I try a LoRA with the same photoreal style. Wan can only do cinematic photoreal style and it would take forever to get an image as sharp as Chroma anyways. Visual artifacts that are typically found on a real camera or photograph aren't a flaw, just a model that captures that kind of detail.
Anonymous No.106288152
>flux kontext
is it dead
Anonymous No.106288165 >>106288221
>>106288148
Go fuck yourself and post a catbox for that image I don't believe that is qwen.
Anonymous No.106288192 >>106288221
>>106288148
>I know I'll change the resolution of the chroma image to the default qwen ones and pretend it's a qwen generation
Anonymous No.106288221 >>106288228 >>106288260
>>106288165
>>106288192
Of course it's Qwen. Pic rel is also Qwen. These are both extremely easy for Chroma to get in its sleep. This is just one of many examples where you're going to see Qwen fall apart due to its censored nature.
https://files.catbox.moe/j75atj.png
Anonymous No.106288223
Anonymous No.106288226 >>106288707
>>106287962
I power limit mine to 450W.
People seem to think every card is burning or something.
Anonymous No.106288228 >>106288391
>>106288221
Either catbox is down or your picture isn't working.
Anonymous No.106288232 >>106288272
>>106288122
catbox an example? and post GPU, text encoder, and which version and quant of chroma.
these conversations usually end as soon as someone has to reveal that they're using every quality-destroying hack to squeeze it onto an 8GB AMD card or something.
Anonymous No.106288239 >>106288253
>>106287982
The ones actually crazy enough to send death threats to people posting ai images or videos? They're a deranged minority.
The general distaste for ai stuff? That's more common, it's the current "correct" position to have so most zoomers think that.
Anonymous No.106288244 >>106288270 >>106288275
Anonymous No.106288253
>>106288239

Image Gen is also mostly worthless and 90% coom material so who cares.
Anonymous No.106288260 >>106288391
>>106288221
I went back through the archives and confirmed it was qwen. Sorry for doubting you. But I stand by my statements. Chroma is ass and full of nonsense.
Anonymous No.106288266
>qwen got svdquant first
where is it for wan reeeeeeeeeeeeeeee
Anonymous No.106288270
>>106288244
wholesome
Anonymous No.106288272
>>106288232
I use the weights from the lodestones repo on a 24gb card using the workflow her provided. Still is noisy mush ass nonsense.
Anonymous No.106288273 >>106288289
qwen vs wan22 for training a realistic character lora?
and best way to train a lora for them?I don't see anything on the OP
Anonymous No.106288275
>>106288244
his real treat is the toddler off camera to the left
Anonymous No.106288289 >>106288347
>>106288273
well qwen will be simpler it being a single checkpoint, but it's hungrier than wan
I would say qwen, you can always i2v it in wan anyway
Anonymous No.106288305 >>106288351
Anonymous No.106288335 >>106288351
Anonymous No.106288347 >>106288354
>>106288289
any rentry lora training guide?
Anonymous No.106288351
>>106288335
>>106288305
Quit yappin
Anonymous No.106288354
>>106288347
not that I know of
I use musubi and it has good documentation
Anonymous No.106288391
>>106288228
Yeah well, it appears to be down.

>>106288260
>Chroma is ass and full of nonsense.
Vague statements like this mean nothing. Models that aren't full of nonsense do not exist in the world of AI image gen.
Anonymous No.106288415 >>106288420
https://catbox.moe/
Oh lawd.
Anonymous No.106288420
>>106288415
https://files.catbox.moe/4ixje2.mp4
lol posted the link to catbox itself. I'm a retard.
Anonymous No.106288431 >>106288436
Anonymous No.106288436
>>106288431
I think I'm having a retard attack.
Anonymous No.106288445 >>106288457
please use litter catbox instead of files catbox
you lads are using up all the space
Anonymous No.106288457
>>106288445
Hmmm nyo~<3
Anonymous No.106288459 >>106289726
Anonymous No.106288515
Anonymous No.106288552
>>106288550
>>106288550
>>106288550
>>106288550
>>106288550
Anonymous No.106288683
>>106287913
I am having fun with 8 but it would feel like a waste to buy one with 12 now
Anonymous No.106288707
>>106287962
>>106288226
the tolerances of the new connector are so bad that it's pure luck if you get a good one or not

it is not a problem that should exist on a 2000 dollar card
Anonymous No.106289188
>>106285835
Hahaha this is the best ai video I've ever seen
Anonymous No.106289726
>>106288459
nice
Anonymous No.106290077 >>106290223
ESRGAN can't upscale my pixel art texture.

Any idea of how to do it?
Anonymous No.106290223 >>106290248
>>106290077
bruh if that's the texture in question you're probably going to need to draw a new texture in higher resolution because that's not really comprehensible
Anonymous No.106290248
>>106290223
My idea is more like making a nearest neightbor upscale in 1024.

But I wonder if I can take videogame textures from real models and make a dataset of low res pixel art downscaled versions and their HD versions.

And then make a LORA that has both low res and high res versions.