migu
md5: cdcf076f5651dd33ae3ad80836f43d4d
🔍
Discussion of Free and Open Source Text-to-Image Models
Prev:
>>105577098https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
>Models, LoRAs, & Upscalershttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
>Cookhttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>ChromaTraining: https://rentry.org/mvu52t46
>WanX (video)https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1
>MiscShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Archive: https://rentry.org/sdg-link
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Bakery: https://rentry.org/ldgcollage | https://www.befunky.com/create/collage/
Local Model Meta: https://rentry.org/localmodelsmeta
>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/celeb+ai>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
Is there a text encoder for wan that is less than 5GB?
Blessed thread of frenship
>>105582517 (OP)Is https://github.com/FizzleDorf/AniStudio better than the other UIs because it was written in C++ instead of Python?
>>105582600How does it feel knowing you will never be a woman?
>>105582581https://huggingface.co/city96/umt5-xxl-encoder-gguf/tree/main
Q6 quant should work but why? Grab a bigger one instead and, if you're so tight on vram, make it unload after encoding your prompt by attaching one of the "unload model" nodes, there's like 5 of them among custom nodes, if not more
>>105582570>Including fennec girl in collage.Why?
posing
md5: 929399a7831a399acc2c10d1e8f26f8c
🔍
>can use my own gens with good poses as control net for other images
HOLY FUCK
>>105582626I'm using google collab.
https://research.nvidia.com/labs/dir/cosmos-predict2/
slop status?
>>105582681Got any example of fun pose transfers you've done?
>>105582774https://litter.catbox.moe/nydiyv7ty2o1soxm.jpg
Actually, for some reason those stick figure depth maps had way worse performance than real images. Do I need a specialised controlnet for those?
>>105582681good job anon.
One day your autismo will let you make autistic shit.
>>105582822Cool. (But also disgusting fetish, faggot.) I've seen these weird playgrounds a lot while trying to gen Hachikuji. ControlNet is fun. Have fun.
>>105582758I'm trying out some very basic prompts for the 2B right now.
Cons:
Can't do nudity whatsoever. A woman will be wearing a bra and panties if you ask for nude. Even SD3.5 medium will do nipples and bare crotch (with no genitals) if you ask for nudity.
The license. Supposedly you must agree to some AI Ethics document thing, and must not bypass any safety guardrails? Whatever that means. Will Nvidia sue your ass if you make a porn finetune of this model? Who knows.
Pros:
Anatomy and consistency seem extremely good for a 2B model. Better than anything else in this size class, probably.
Did not seem as slopped as flux, but I will need to do side-by-side comparisons to know for sure.
>>105582903can it do feet
>>105582822>Actually, for some reason those stick figure depth maps had way worse performance than real images. Do I need a specialised controlnet for those?If you mean the OpenPose preprocessor, it only works well on live action input images. For anime input images, I stick to Depth and Canny.
>>105582915Ah, I see. There's a rank 128 and a 256. Just grab the bigger one?
hey quick questions, can my vram get ram rot if i'm constantly switching models? there was a thread on /g/ where people were talking about their ram getting rot (or smth) for being unused or overused a lot
>>105582903>>105582758the Nvidia models are obviously a worthless dead-end due to how restricted they are. I hope comfy at least got paid by Nvidia to add support for that model.
on the other hand, the Nvidia models do seem to have some good architecture and optimizations. I hope model bakers take note and incorporate their improvements.
>>105582960I don't know, I use promax with the default preprocessor models from comfyui_controlnet_aux.
file
md5: f23e8eb216a87011461be16e0ca65cb8
🔍
Finally can get a 5090 on Amazon. Then I can put this baby into the dungeon to be forgotten.
What happens if flux kontext gets an nf4 release only, meaning only 50 series cards can run it? is there a way to uncuck the model should that happen?
tinfoil hat but i feel like things are getting more and more fucky what with nvidia interference (heh).
>>105582913the most important question indeed
>>105582993>worthless dead-end due to how restricted they areWhy are they doing that, it makes no sense to me to lock down the licenses when their business is in designing and selling the chips to run the models, so their incentive should be to make these models proliferate as much as possible.
>>105582891based arararararararagi connoisseur
>>105583090my guess is they have some leftover compute to burn and make these as some kind of advertisement or proof of concept. Also, possibly this kind of model might have a market in a corporate setting, where HR doesn't want accidental dog penis or titties getting in their stock photo gens.
I realized when civit basically implodes I'm fucked because I lack creativity.
>>105582600>better than the other UIs because it was written in C++ instead of Pythonyes but also no because the space is overpopulated with script kiddie grifters that don't actually want to learn how to dev
>>105582853>Amerindianwife
Can aI output transparent pngs?
I'm using reforge along with forge coupler, because I'm trying to gen an image with two different characters, but it just turns out like shit because apparently forge coupler sucks vs. A1111 latent coupler. Is there any solution?
>no ac cuz yurop and electric bills went up by 9001%
>press gen pc vomit air at 98°C
>open a game pc vomit air at 80°C
>read a manga pc vomit air at 50°C
>30°C in my room at 8 am (yes am)
well, goodbye i guess ill turn on my pc again around october
>>105583308Sora can.
I don't think locals do, but I am not sure.
>>105583308it's a controlnet thing or just rembg
>>105583308Not out of the box certainly. The few times I needed a transparent background I've used Gimp.
Make a selection with the wand tool, grow the selection by 1px, add a new layer, set it to color erase, and then use color fill on that layer to filter out the background. If you need to automate that, you'll spend some time with python and pillow or something similar.
>>105583316Forge Coupler is indeed worse than A1111's but have you read forge coupler's github? The prompting meta with it is a bit different.
>Is there any solution?None that I know of.
The entire auto1111 derivative ecosystem is dying anyway. We will all be forced to learn trannyUI sooner or later.
IMG_2647
md5: c67fa18b1871661b4f9184177f9d98cb
🔍
>>105582517 (OP)>still no napt in neighbors listew.
im not carrying the thread this time
best of luck
smell ya later!
>>105582539if by "sad" you mean intentionally malicious and retarded for the millionth time
then ya, accurate
>>105583308>>105583308There was a thing callled "layer diffusion", but I never tested it, you should look that up.
>>105583399What's wrong with the bake?
>>105583419What do you mean?
>>105583368Yeah, I've read the github and I've been doing as it instructs, I just get shit results. It always wants to bleed the loras together, even when the regions are properly segmented, or it simply just doesn't understand the pose I'm asking.
>>105583443You said the bake is malicious and retarded. What should we do to fix it? I'm not this baker, but I can try.
>>105583455What's wrong with the bake?
We did it boys. The average normalfag and even some artfags can't tell AIart from regular shit anymore. They're even defending it. We're so back.
>>105583465I don't see a issue but I see a ton of other issues especially shilling and advertisement fails followed by crying
>>105583543What do you mean?
how the hell do i get comfyui DESKTOP to use sage?
sageattention and triton-windows both installed via the built-in terminal, added --use-sage-attention to the shortcut, restart the app and the log still says "using pytorch attention"
>>105582951why aren't you using faceid with his picture for these? are you some kind of retard?
>>105583543Literally julien wow
>>105583586I don't dox, unlike that faggot I have limits which makes his falseflags even more embarrassing when he has melties.
>be ran
>unemployed
>government housing
>contribute nothing
>stir shit all day
>obsess over people who don't even think about me
>spend all day generating pictures of my perceived enemies
>fart and shit in my diaper
>smirk, "I am a winner!"
>>105583601He doxxed someone?
>>105583613Only himself but he likes to blame people for things he puts on himself. Now he's seething despite me owning one of the most expensive gpus on the market and detailing my build process.
Amazing that he can't work smarter during his day job
>>105583543>s-she said WHAT?>well, your child is l-lying then!>s-she wanted to s-show me something a-and it was her idea i s-swear!
>105583627
16 hours every day
Ah great more projection
If you're this desperate go back to your containment and stop seething. You retards have been seething for days
uh oh ran called his discord trannies in again
Julien can you just fuck off to /sdg/ please?
>inb4 ran singular schizo anon nonsense
>>105583678It's his safety word, he's been seething for days using his cope nickname only to be told to fuck off by regulars. Ani should spend more time fixing his shit instead of ebegging for anons to work on his for profit product that he used to flaunt claiming colleges would use it and business men will buy it from him. Such a fraud retard especially with the state that fucking thing is in.
now they're jerking each other off again
>>105583567>comfyui DESKTOPelectron slop, not even once
QuantStack/Wan2.1_T2V_14B_FusionX-GGUF Q6 8 steps 109 frames
Prompt executed in 215.73 seconds
much much better than self forcing
187 seconds
i missed movement in the background like you wouldnt believe
>>105583783Most gennable miku song?
>>105583749>>105583762The weird flickering with high contrast in the first few frames is due to some incompatibility between baked-in loras. Manually applying accuvid/causvid with mps/hps loras to the base model will probably yield better results with the same amount of steps
>>105583825>The weird flickering with high contrast in the first few frames is due to some incompatibility between baked-in lorasive been getting it on normal wan and also on self forcing. i have no idea what it can be. at this point i think its the Q6 quant because I genuinely have tried every combination of optimizations and no optimizations and it keeps happening
also I need to unload and reload all the models every time or the next gen is a fucked up version of the previous gen, and trying to prompt after that gives allocation on device errors
these are all hoops worth jumping through but i have no idea why i'd be having these issues with this merge but not normal wan
https://huggingface.co/collections/nvidia/cosmos-predict2-68028efc052239369a0f2959
>Potential Known Risks: The model's output can generate all forms of images, including what may be considered toxic, offensive, or indecent.
did anyone tested that out?
I feel like lodesteone has killed chroma since V29.5 (when he started his distillation bullshit), now all my outputs are completly slopped on the newest versions, they now look like regular Flux images (plastic skin, "professional" lightining", background blur...)
>>105583915i'll also be honest famalam i'm not seeing the improvements since this thread started posting chroma gens
>>105583870i asked about this ages ago, no one gave an answer why it even happens. after like 2 gens it eats complete shit like in your image.
>>105583915>I feel like lodesteone has killed chroma since Yeah, what he did was completely retarded, why didn't he just let the training run as normal and do the optimizations after the training?
>>105583915 (You)
>I feel like lodesteone has killed chroma since V29.5Yeah, what he did was completely retarded, why didn't he just let the training run as normal and do the optimizations after the training?
>>105583879>did anyone tested that out?their previous Cosmos was ass so I'm not really hyped by that one, I'd like to see someone provide some examples though
>>105583846The best practice imo is to use two ksamplers - one with 2-3 frames at high cfg, then the 2nd one with the next 5-6 steps at 1 cfg. Doesn't eliminate it completely but does look at lot better
>>105583749Is this model a drop-in replacement or does it need other adjustments to the workflow?
>>105583732rightclick is broken in browser
>>105583995NTA but can you share a workflow?
>>105583975Wow, cool. Good job, anon. What's the prompt? I can't even tell if this is Chroma or SDXL.
>>105583989the Miku spammer cound make a Miku made of ice
>>105583970>>105583960>samefag>and also retardedAnon...
>>105584074>coundcouldn't
file
md5: f4e80d814f71d5bb4302c8529c9b5090
🔍
>>105583879https://research.nvidia.com/labs/dir/cosmos-predict2/
It doesn't look bad, but I'm surprised that model 2b doesn't look much worse than the 14b one.
>>105584088because 14b is also trash
>>105584061Sure https://files.catbox.moe/897ihu.mp4 workflow embedded
base wan i2v with loras, remove image-related stuff if you want t2v.
Loras are https://huggingface.co/Kijai/Wan2.1-Fun-Reward-LoRAs-comfy/tree/main and https://civitai.com/models/1585622?modelVersionId=1794316
I think you'll find the nsfw ones yourself
>>105584088It never looks bad until you try it out on your own.
A street - how hard.
Then they cherry pick the best ones anyway.
Similar case to this Miku faggot.
It's like asking some guy to draw a cat. Everyone knows how to draw a cat.
>>105584088they are pretty stupid models that don't accomplish much. It's mainly for the robutts to train
>>105584125>A street - how hard.it's hard because they have to render a lot of humans that are far away, usually image models can't do that
>>105584088The reality is 14B models are actually extremely undertrained despite getting all the training resources. Wan with 1.2B is not bad which makes me curious what a properly and sufficiently trained 4B model would look like.
>105584143
>schizo chimes in
>>105583879>Nvidia themselves giving us a SOTA uncensored model>Something that actually may be better than Chroma etc...Shit, not something I expected, but they know what they're doing. I'll take it. From what I've seen so far it looks as good as Mogao (but better), and it's completely uncensored...
>>105584158>Something that actually may be better than Chroma>From what I've seen so far it looks as good as Mogao (but better)Chat is it true?
>>105584157>you grazed my ego by disagreeing so you must be the schizo
What ego? This is why you are delusional.
>>105582517 (OP)Thank you for baking this thread, anon.
>>105582570Thank you for collaging the previous thread, anon.
>>105582585Thank you for blessing this thread, anon.
>>105584067its illustrious. the prompt is pretty simple so the result was mostly rng. I would try it on chroma if it did not take me 11 minutes per image with it.
oekaki, hatching \(texture\), 1girl, space, fetal position, floating, inside egg, transparent egg, star \(symbol\)
>>105584181What do you mean?
>>105584204Thanks a lot. I've just been told that Illustrious (base model) has the biggest creative range, especially for abstract images. If this is indeed IL base model this is a good case in point.
I wont pay money to test video gens.
Already scammed 4 times with this garbage tech grift.
Give me some collab I can run in google collab that can make img2vid wan.
>>105584158>26GB to test 2B>48GB to test 14BDamn, got really excited for that 14B for a sec but can't test it out myself.
We need a quant now.
error
md5: 42a67ceb9df6f88f0f39805276d22551
🔍
This tech is a scam.
>>105584245I doubt it's asking for that much VRAM, it's probably because of the text encoder or some shit, like usual I'll wait for a comfy implementation
>>105584270with virii and tracking
>>105584219ah the model itself is coco-illustrious 7.0. The base illustrious 2.0 might be able to do it but I had meant that the model was an illustrious based one.
>>105584265plz subscribe to API nodes plz tyvm ;]
>>105584306where do I get wangp model and text encoder?
>>105584355Cu-Cuh-c-cute! H-ha-pp-p-py..F-Frid-AY
>>105583933>i asked about this ages ago, no one gave an answer why it even happens. after like 2 gens it eats complete shit like in your image.weird. oh well its the best we have right now
>>105583995>The best practice imo is to use two ksamplers - one with 2-3 frames at high cfg, then the 2nd one with the next 5-6 steps at 1 cfg. Doesn't eliminate it completely but does look at lot betteri'd love to do that but dealing with the OP rentry WAN workflow and its SamplerCustomAdvanced is too spaghetti for me. do you have a workflow you can share?
i managed to make low poly on Pony in a funny of way (Lora of Caius, the old meme guy from morrowind + gta san andreas Lora). the characters might retain some facial features of Caius but i like how it looks
to pull, perchance to spend hours troubleshooting and eventually reinstall
So I learned you can give persons names in Chroma (sorry if it's old news and it works for Flux too).
>The old lady is named Ada. Ada is x, etc.
The bad news is it rarely straight up prints the name into the picture and that it seems worse for prompt bleed than directions (for example, the old lady in the middle). But if you name someone "X", for prompts it's certainly shorter than "she".
sleep
md5: 2e7b6557403cbf4190533bc94e525640
🔍
>>105584601As always you have to understand how training data is captioned. So much is based on people having wild understandings on how captions are typically done and any of this behavior is from auto captions which are based on famous people the captioning model recognized. e.g. "This is a picture of Tom Cruise at a Gala event. He is shaking hands with Bill Gates.".
>>105584601>prints the name into the pictureinstall IOPaint
yeah I'm happy with this until the next paradigm shift. this is fast enough and quality enough for all my uses
>>105584658he's federally acquired i'd say
>>105584647can you cut out the vae frying the first few frames?
>>105584631The Rich People said No - and now, any resemblance to especially rich people is Very Illegal.
That's the end of it.
>>105584658what are you gay or something?
>>105584671you know what? fine. i VILL try the double sampler suggestion that anon gave
>>105584676That's not the law but if it was it would be a good thing.
>>105584647 judging by this.
>>105584521here
>>105584123just replace wan2video with empty latent if you're using t2v
>>105584686>That's not the lawIt is the law pretty much sonny boy.
>>105584699The law is distributing sexual pictures of real people and other forms of defamation. Makes me glad to know your days of posting online are numbered though, only a matter of time until you get a perma'd for breaking US law here.
>>105584692thank you!
what's the point of the TrimVideoLatent node in between your two samplers?
>>105584750Literally me except I'm skinny as fuck
>>105584710Oh wait let me guess you are from Commifornia aren't you mr. Cockler?
>>105584710I : as an ESL is like a 2B model > ready to rock but not yet - there is too much vocal incoherence in between my desk and your ears~!
There was a time in which I could buy everythin I wanted with my card... *instrumental*
>>105584758Read somewhere on r*ddit that it somehow helps with the color flashing even if it doesn't remove any frames (like in my workflow). I'm 99.9% sure that it's placebo but I kept it just in case. Just like the color match after decoding - it was an attempt at forcing those first frames to match with the original frame's color tone. Didn't fix it completely but it did make it look less shitty. In the end, adjusting the values of causvid and hps/mps loras is what made the flickering (almost) go away
https://files.catbox.moe/e0srb7.webm I'll blame my own skill issue on the cigarette disappearing - should've described it better
>>105584853i guess i need two load model flows so teacache doesnt apply to both sampling steps
this is not worth it, I could be genning during this time
unless you'd like to be a lad and patch OP Rentry's WAN workflow to use the two samplers but also use TeaCache. I also noticed you're doing DDIM but it seems like LCM is the one that is the new hotness (for self forcing especially)
>>105584885Teacache is incompatible with those loras as far as I know. It's shit anyways, always messes with smaller details, especially the finger movement.
>>105584906>finger movementi don't expect finger movement to be good ever because of training data and motion blur. but anyways thanks for entertaining me and for your time.
>>105584885It's the /pol/ fan. Thank you - my grandfather was serving in Germany and he was working hard in uboat...
>>105585013she looks autistic
>>105585013Why she's bleaching her juden dark hair? Main Grandfather didn't die for this, algemein.
AI has destroyed my ability to guess ages
>>105585041>she looks autisticeveryone looks autistic sitting on a swing past the age of 10
502
md5: c0676252df26decf7848e2eb87c88509
🔍
Ok so openpose 2 seems to be working better with anime, but it doesn't take the images properly sometimes. Definitely usable tho.
>>105584853Causvid 1.5 and 2.0 LoRA fix the occasional flash that can occur at the start of a video that the 1.0 LoRA sometimes caused. Causvid 1.0 and 1.5 increase saturation when CFG is past 0.30 but Causvid 2.0 can be run at 1.0 without any appearance changes. I also noticed Enhance A Video may sometimes cause the first few frames to have a grey filter when using it in conjunction with Causvid so I turned that off. The last couple things I can think of that cause color shifts and or that grey filter are all the overcooked LoRAs. Sometimes rolling a new seed after lowering the strength fixes it but I think adjusting block strength might do it as well while allowing you to use the full strength of the LoRA's motion though I never got around to actually testing this since I don't use the Kijai wrapper node workflow.
>>>/b/935730844
check this shit out
>>105585194https://civitai.com/models/1528155/breast-expansion-wan-i2v
>>105585088Such a cutey, and she's roughly between 14 and 38yo, I'm a professional, trust my opinion.
holy shit the cosmos shilling is insane.
are you retards ignoring that anon that tried it on purpose? it's censored as fuck and trained on warehouse footage.
>>105585267not everyone is trying to generate porn, nigger
>>105585222>i2vno thats actually a serious crime. i just do light crimes with my wan cuties
and the anon on /b/ said it was done with frame to frame
>>105585290it doesn't understand anything pop culture or copyright related.
what are the correct sampling settings for the 14b? it doesn't work with the same ones as the 2B (runs but the image comes out as indistinct noise)
>not even "unload all models" node along with manually clicking unload all models and clear node cache fixes the OOMing after subsequent workflow runnings, requiring you to nuke the instance every time
oh my fucking god
>>105585390yep. i dont even know who to direct my anger to. why would a merge cause this kind of issue?
>>105585316Thank you Ran... Your images have most detail.
You almost double your last resolution? Wow... 1:1 very good very good
>>105585469Sorry I forgot letters because I am from Bangladesh / ESL.
>>105585316You are so beautiful...
what's the current best anime model?
>>105585088she's agefluent, 17-21 I'd guess. 16? older. 22? younger. voila.
>>105585267>trained on warehouse footage.Unironically forgot about that. Cosmoverse is fucked.
>>105585531the prompt was for a 13 year old, which is surprising because i would have thought putting them on the swings would make them go younger on average (unless there's millions of training data videos of adult girls on swings?)
>>105585529NoobAI-XL, Kris.
>>105585550Can the model even make 13 year olds in what looks like hot pants and a tight top? I remember trying to push flux when it came out, going down with the age. It eventually just jumped from a 20something to a baby (like literally a toddler), nothing inbetween. Anyways, why are you genning 13 year olds on swings?
>>105585545>>105585508>>105585469You do realize you talk about me more than 9+ hours a day right?
I don't know why I'm the focus or why your autistic ass thinks I use a avatar when all my images are different.
https://huggingface.co/lodestones/Chroma/tree/main
what's the difference between "detail-callibrated" and the normal one? is the quality actually better?
>>105585617? i was just here to enjoy your art. seems like you are spiteful despite that there are people who enjoy your art.
>>105585615WAN is smart enough to be able to do it, its just sometimes inconsistent and affected by the prompt context
its almost TOO good at going too young though..
>>105585615>Anyways, why are you genning 13 year olds on swings?i like looking at cute young girls without telling facebook or my ISP about it
>>105584601i wonder if you will find facial patterns for different names that way, as in the average Nelly gen looking distinctly different from the average Ada
>>105585617I would like to chat with you irl, we could exchange drawings and ideas. I respect you as an artist.
>>105585615>I remember trying to push flux when it came out, going down with the age. It eventually just jumped from a 20something to a baby (like literally a toddler), nothing inbetween.young teenager or even young girl is the phrase youre looking for youre welcome
>>105585147the 3d shading is really cool on this
>>105585649Haven't tried anything but the detailed version so lmk
>>105585617Why not join forces with people who can create music?
https://www.youtube.com/watch?v=S-oH2bsfe2k
>>105585742This is my offer on 777 table.
I have given as much as you have.
>>105585778I have no idea what you are talking about also I don't get out of bed for anything not paying north of 2k+
>>1055857787 means snake eyes, three 7 means you are the anti-christ.
>>105585786Oh really I guess you must have accumulated quite a portfolio by now?
>>105585795This thread is about imagegen please stop
mikubakes are pushing me back to /sdg/ where im stuck with that annoying butterfly poster
>>105585710That was just to see what the model can do. I stick to >18, give or take, heh. Every model reacts different to the term 'young girl' tho, lustify will give you something18, while zavychroma will generate an actual young girl. 'young teenager' is not something I've ever used in my gens, on purpose. There is also shit like 'tiktok' and 'influencer' for those bizarre balloon lips, eyeliner, wavy hair, bla.
>>105585788dude wtf
>>105585812I can walk on this yard without burning anyone.
I wanted to have a project with Catjak. But I don't know him personally.
https://www.youtube.com/watch?v=3TivdCrjBNg
>>105585689I don't believe there's a pattern, how parents name their kid is kinda random
>>105585390best part is that it doesn't even actually unload the vram when you click that. only ram.
Anyone able to help?
Framerates good but i have seen MUCH clearer videos. Like her eyes keep glitching etc.
I have a 4090 (laptop).
>>105585818just browse both
>>105585856every day when I wake up and check the /ldg/ collage for inspiration. imagine my disappointment when its a fucking mikubake
Ran: if you want to have a chat, leave a message. We could work together.
>>105585866they keep mikufag on a tight leash
>>105585330nm the issue seems to be with fp8 loading not working with Cosmos. when I load at full precision, it's fine. checked and 2b is broken when you run it in fp8 as well.
>>105585529noobai, plant milk, chroma, illustrious
what sampler am i supposed to use with wan gguf? does causvid need a different sampler from that?
Zuckerberg made it free for every American.
He was staring at your wife.
https://huggingface.co/tencent/Hunyuan3D-2.1
I have a good news for the anon that was waiting for this
>i'm hiding my foreskin isn't long enough
Well son, you are not a real American. You are an Israeli mother fucker.
2685131
md5: 19ed2b79f4eb0e38478d394d24cc8df5
🔍
How do I turn the control net poses into regional masks? I'd like to redo the "general environment prompt + char1 prompt + char2 prompt"
where the hell is controlnet tile for n00b
can someone explain what's the diference between WAN VACE and normal WAN?
>>105586025Please post an image of your setup.
>>105585996oh yeah it was the 2.5, I'm sorry... :(
Cosmos is completely useless for art, the outputs are somehow even more slopped than Schnell. Which is quite an impressive feat since it uses cfg and isn't a distilled model like Schnell.
>>105586041https://files.catbox.moe/9agdyq.json
I hacked this together for a one character gen
>>105586073Great, just wait for a second. I'm concentrating on a guitar solor.
>>105585617https://www.youtube.com/watch?v=0T5QaGMkhFg
Thank you for replying to me. This is my gift to you.
>>105585829sure but maybe the training data has names in it with a less equal spread
>>1055857881girl, imminent_rape
>>105586040vace has controlnets
>>105585829Sorta. Like I bet if you used "Agatha" you'd get an old timey picture.
>>105586130erm
acksully
ani a bean, not a bee
>>105586130https://www.youtube.com/watch?v=0Sn0J5INxyA
>>105586130I love you! <3
>>105586138Yeah, true, the only pattern on names is the probability of being born at a certain time (and the race I guess), but nothing else.
I forgot to ask from the youtube brainwash routine.
https://www.youtube.com/watch?v=4ntUXe5NZ5s
Oops it wasn't a mistake.
>>105586183there is some shit buried in the training data. race, certainly. old fashioned names, yes. I made some wildcards with census data, female names, uk, 1985, etc. one model maker specifically mentioned names having an effect but I forgot which one that was. something sdxl realism
>>105586179ok she is both 15 and 45 at the same time.
>tranistudio and auto faggots absolutely mogging cumfarts
comfy is losing it's edge
>>105586179is this worflow optimized? hunyuan is so heavy
>>105586245Only person using ani studio is ani because his shit is broken and slow
>>105586266she is mogging despite this, explain that
>>105586261no this was from december and took 8 minutes to gen. hunyuan is obsolete compared to wan. I'll try to do a similar prompt on WAN and see what I get
>>105586282nta but I have not heard of whatever ui you're talking about until I saw your post, and I am a pretty big imagegen nerd, so it must be very obscure indeed
https://xcancel.com/multimodalart/status/1933449438464561271
>Normalized Attention Guidance brings expressive, high quality negative prompting to FLUX, Wan 2.1 and more
Negative prompt bros we eatin good
>>105586321holy shit, negative prompts for flux would solve so many of the problems that drove me to chroma
>>105582517 (OP)>Cosmos didnt anon realize that shit was busted months and months ago?
>>105586330>holy shit, negative prompts for fluxflux doesn't have negative prompts, flux dev has cfg = 1
Why tf does my used disk space go up by "loaded model size" GBs? Are you telling me this shit loads it up additionally? Can it just not hold in ram?
its better, but doesnt have the hunyuan sovl
>>105586340New model:
https://github.com/comfyanonymous/ComfyUI/pull/8517
>>105586352anon read the post I was replying to ffs
>>105586351please understand he only implements meme models and coin-grabbing nodes now
>>105586364nah shutup, I am a comfy respecter
>>105586359>Get the vae from here: wan_2.1_vae.safetensors and put it in ComfyUI/models/vae/wtf? why did they use wan's vae? flux's vae is better and has the apache 2.0 licence
>>105586339>>105586359Attaching an image to my post is unneeded to point out how shit these gens are and how low Cums standards are for images, oldfags understand this - HOWEVER - it makes him happy (lol) so there's nothing I can really do.v
>>105586376They also released some video models with the same architecture. That means they needed to use a VAE that can do both image + video.
>>105586321So it makes the negative prompts effect better that's it?
>>105586370what makes you think im not? im just being real here
>>105586396still it's a retarded move, they should've used flux's vae for the image model
>>105586370>I am a comfy respecterhard to respect comfy when he doesn't want the good chroma implementation on comfy native
https://github.com/comfyanonymous/ComfyUI/pull/7965
>chroma shitters coming out in full force
>>105586321Look at the examples!!!
https://chendaryen.github.io/NAG.github.io/
This massively unfucks Flux, and makes negatives more accurate like NegPIP does, except it works for ALL MODELS? SDXL, Wan, SD3.5?? AND FASTER?
What if you put bad anatomy in the negatives using this on 35m??
>>105586415But then if they do that they can't use their image model as the base for their video model.
>>105586359>>105586396you already shilled this today, people tried it and it's shit like almost every fucking model you try hard shill
>>105586358hunyuan is more natural, but so poorly optimized. i found new workflows on civit. i'll check it out.
>>105586439>shill hater attacks comfyI'll allow that
>>105586434>This massively unfucks Fluxhow can flux use negative prompts at all? I thought it was a distilled model or some shit
>>105586321the brought real kino soul to flux
incredible i have tears in my eyes
>>105586398it doesn't make negs "better", it makes them work at all. you normally can't use a neg on models that use cfg 1 (if you input one it will be totally ignored).
>>105586460>you normally can't use a neg on models that use cfg 1 (if you input one it will be totally ignored).but wan is working at cfg 5 ish, so it doesn't change anything for that model?
>>105586434>10x faster on wanwtf is this magic?
>>105586321Demo: https://huggingface.co/spaces/ChenDY/NAG_FLUX.1-dev
>>105586358dude it looks like SHIT
file
md5: 25b769f46f62ba575c94cacf6c7f0a16
🔍
Not very impressed with cosmos, maybe the 14b one is good? Can't test since I'm a vramlet.
>>105586434We are so back.
1
md5: a87136b6ede65236a9dfc613c6ac808f
🔍
>>105586469>so it doesn't change anything for that model?no, it actually just works better. CFG negatives are notorious for being ineffective, hence we needed this kind of thing for SDXL models:
https://github.com/hako-mikan/sd-webui-negpip
CFG is also expensive.
Pic is from their examples, the right side is with NAG:
>SD3.5-Large, 25 steps, CFG>Prompt:A tiny astronaut hatching from an egg on the moon.>Negative prompt:Low resolution, blurry.
>>105585832adjust your prompts
use better lora
>>105586506So you're telling me that they managed to make wan work at cfg = 1? all right now that's interesting
nag comfy node where.
it does look sinteresting for wan though.
>>105586483>ZeroGPU quota reached EVERY
FUCKING
TIME
comfypls
ok this shit is borderline magic https://huggingface.co/spaces/ChenDY/NAG_wan2-1-fast
>>105586483>Demo: https://huggingface.co/spaces/ChenDY/NAG_FLUX.1-devnoice
>>105586321https://arxiv.org/abs/2505.21179
>the chinks saved us from CFG and made everything 2x fasterI kneel again Xi Jinping!
file
md5: 4b0b82f5e10a29753929cda85e7340fe
🔍
it's bretty good
>>105586598When is brother Xi gonna release local Sesame 8b or send me a state mandated bugwaifu so I can finally start learning chinese???
i will piss and shit if this turns out to be snakeoil again.
now how long do we have to wait until they release their comfyui node
>>105586598why does everything good in AI come from the chinks now
it's the same for llms
file
md5: 0173a83520f87882273749c02e460eca
🔍
>>105586614>why does everything good in AI come from the chinks nowthe westoid pigs are too busy cucking their models and getting lawsuits from disney, the chinks doesn't give a fuck and want a good product to win the most important race of the 21th century
https://edition.cnn.com/2025/06/11/tech/disney-universal-midjourney-ai-copyright-lawsuit
>>105586321SDXL for us vramlets please?
>>105586614because nvidia is raping the west and makes it impossible to experiment outside of a cucked corporation but I'm sure there's some universities with projects cooking that can be interesting
>>105586645>because nvidia is raping the westNvdia is an US company lol
>>105586653Yes and if you want any their compute and not pay hundreds of thousands (or millions) of dollars you have to dance for them whereas the Chinese seems to have their government sponsoring them to be as disruptive as possible. Nvidia is a horrible company just like Disney and they have decided to go down the route of fucking everyone in the ass rather than creating a sustainable 30 year strategy.
>>105585966answered myself, works with the wan merged with causvid and whatever else
ddim/ddim_uniform
euler/beta
>>105586321Ok that's good, when ComfyUi?
>>105586693when anistudio?
>>105586434>This visual illusion art is both surprising and delightful, blending everyday objects with creative craftsmanship for an engaging and fun reveal.why do these esl mongoloids always train shit with prompts like this
>>105586733they used a llm to make that prompt lol