collage
md5: 979e4b0e13ccf33b3391cbb253534d1b
🔍
Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>105895325https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows/home
>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanX (video)Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1
>ChromaTraining: https://rentry.org/mvu52t46
>Illustrious1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate
>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/celeb+ai>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
comfy should be dragged out on the street and shot
>>105902251Comfy is one of the best things happened in AI.
A revolution made by one humble guy.
>>105902244 (OP)>only one pepewe demand more pepes in the collage
123
md5: f6eee6f101bd5c827eebf2bf92601596
🔍
>>105902268I love this man. He does seem to have a bit of an accent. Where is he from?
Blessed thread of frenship
Fed previous threads trough Kimi2, prompted for style of HP Lovecraft and then put trough Chatterbox. https://vocaroo.com/1rh1s1gYVRap
vramlets - when will they ever learn?
haven't seen debo in awhile. what happened to him?
>>105902389Now this is shitposting, impressive.
>>105902404Fuck vramlets. If you have less than 16gb vram you need to fuck off.
>>105902476Why do animated characters never shut up? Feels like most gens always have the chars talking or singing
>>105902484because whoever trained the model used videos of characters talking.
00164
md5: 117a5680f282d55a7bdee047844ed00d
🔍
>>105902500Wondering if there's a way to make them shut up
Because i have that problem all the time in my gens
If I have a bunch of video prompts queued, and one video starts generating while I am on a different workflow, will it continue using it's original prompt from the workflow it belongs to? I ask because I see nodes activating on the currently open workflow, even though this video isn't the one being generated
Anyone have any tips for batch generation in ComfyUI? Like I wanna have it run a list of prompts and dump outputs in a folder
>>105902679mouth movement/talking/etc in the negatives
>>105902655im a big fan of abstract work. we need more gens like this
>>105902851>mouth movementWon't this limit their expression too? I don't want them to talk but they should still be capable of changing their mouth expression dynamically.
beginner comfyui user here, been on forgeui for a little while, was forced to switch cos my forge randomly stopped working and was unfixable.
but liking how much faster (and accurate?) comfy has been so far!
if anyone has general wuick advice or must have nodes or such please let me know
>>105902860depends. add more strength to the positive prompt of a particular mouth expression you want
>>105902835Yes, the browser just sends workflow file to the backend to execute.
>>105902865>was forced to switch cos my forge randomly stopped working and was unfixableIf you couldn't fix forge you definitely won't survive using comfy.
What is the final redpill on FramePack? Was it worth abandoning Forge for?
>>105902716>>105902476Fuck off with your coombait.
Can I get some settings for styles pwetty pwease?
>>105902891Who the fuck still uses that outdated shit? It's a hunyuan fine-tune, and hunyuan is already old news.
>>105902244 (OP)Why is FusionX still recommended in the wan guide?
>>105902922because vramlets.
>>105902883no it was literally unsalvageable lol. i did a whole reinstallation from scratch and it didn't help either. pc just decided that forge was constantly low on vram for no reason at all when it wasn't even being half used
>>105902899i don't get it, why does this thread bully people posting gens?
>>105902899Retard. Gooning is the only reason to learn to ai gen. It's both what sells and is fun to look at. Nobody cares about artsy boring sfw shit.
>>105902899>>105903059>fuck off with [thing I don't like]You both are redditors.
>>105903108You misquoted me. I was only making fun of him.
>>105903059Gooning is what got me into home servers. Gooning is what got me into programming. Gooning is what got me into AI. I'm now interested in learning more so I can contribute to open source AI projects, in hopes of expanding my goon capabilities.
>80 deleted posts
What happened in the previous thread?
>>105902716Nice expressions, shame about all the hands being fucked up. What's the model/loras?
>>105903213>What's the model/loras?https://civitai.com/models/1626197?modelVersionId=1852433
>>105903168Shitposting Americans happened.
where is the sequel to this?
What are you anons hopes and expectations for Wan 2.2 (which we know nothing about other than the fact it will be released)?
My hopes:
- Better prompt adherence (by far the most important thing)
- Longer videos than 5 sec
- Performance improvements
- All in one video model (the same model does both 720p and 480p, as well as T2I, T2V and I2V)
Expectations:
- Meme shit like adding audio to the videos like veo 3 (chinks like to copy the fancy stuff big tech do instead of improving the fundamentals)
now that kontext is over and done with, what are next?
>>105903459illustrious 3.5 vpred
>>105903459ponyv7 is releasing in 2 weeks
chroma v50 will be releasing in a few weeks
wan 2.2 will be releasing in a few weeks/months
radial attention will be supported in a few months
>>105903476We are never getting a local release for that.
>>105903482> ponyv7 is releasing in 2 weeksif it's still based on auraflow it's gonna be doa. the model is slow as balls and really shit to train loras for.
>>105903482>chroma v50 will be releasing in a few weeksWon't that take a full month or so per epoch when lodestone starts training the final two epochs in 1024p?
As far as I remember the timeline for v50 is October
>>105903494It is, and yeah i'm not looking forward to it. I know some people might though, so it deserves a mention.
>>105903504Yes I nearly forgot. Eitherway, there's plenty to look forward to. Exciting times for the local scene
>>105903405>can't even shut up while gooning
>>105903636This is what (((they))) don't want you to know
these wan anime finetunes suck, there is barely a difference using base wan. too much 3dpd in the datasets just fucks everything
>>105903815>there is barely a difference using base wanThere is clearly a difference. Base wan is nowhere near as dynamic and not trained for nsfw.
I'm looking at some things on the Wan 2.1 repo and seeing some stuff that I don't see often in guides:
1) They say you should prompt in chinese because the model was trained on image to text pairs in chinese. this feels easy enough with google translate or whatever.
2) They mention doing "prompt extension" a ton as THE way to get a good result from Wan. I'm a comfy noob, can someone show me how to use this in comfy with qwen?
this is the guide I've been using. I've got stuff genning but am having a hard time with good prompt adherence (like removing clothes)
https://rentry.org/wan21kjguide
Can I use distances in metres in my prompt or does AI have no clue about that?
>>105903645my body when I'm trying to get a good night's rest
>>105903832>like removing clothesif you mean dynamically removing clothes, like a video of a character declothing, I highly doubt it's possible to get a good result.
I'm dense and ignorant. Where can I find the latest release of Chroma?
>>105903832>removing clothesyou are using loras right? if not you'll have a bad time
https://huggingface.co/dnad244/wan_random_loras/tree/main
>>105903855ive got to boobs, now trying panties
>>105903871not yet - was trying to hold off till I got to loras but maybe its necessary?
>>105903831just use a lora instead of downloading another 15gb checkpoint. I think you might be retarded if you think it's more dynamic
>>105903889it's absolutely necessary unless you want to waste your time
>>105903906>loraToo limited
>I think you might be retarded if you think it's more dynamicI've compared several dozen instances using the same base image you laughable idiot. There is absolutely no question aniwan is more dynamic and animated.
>>105903922it's the same 3d-esque slop as wan retard. it's not convincingly anime. the subjects have morphing anatomy too. just looks like the realistic and 3d data are fucking everything over
Is Flux supposed to be so slow or is it just the Forge implementation? Trying it out on a 5090, it takes 67 seconds for one base res gen on the full Dev version and 35 seconds on flux1-dev-bnb-nf4-v2.
the torrent from the last thread
1337x removed it and demoted me from uploader position
wtf
keep it seeded lads
I will never remove it from my list
>>105903935sounds about right. welcome to why people don't use flux often
>>105903933Cry more idiot. You have no clue what you are talking about.
>>105903954are those tears anon? why be such a cry-baby when you hear the truth?
>>105903950Yeah kinda not worth it to me, back to anime slopping I go.
>>105903962No need to project your insecurities, idiot. It's clear you don't like it when people don't see things your way. I'm happy to keep correctly labeling you what you are: a moron :)
>>105903935catbox an image and I'll compare
>>105903968>No need to project your insecuritiesthen why are you? we all want an anime video model that's good but I can't excuse this slop job
>>105903983You're on your own here, idiot. Nothing's perfect but aniwan is far better than your garbage loras lmao. You are terrible at training.
I'm trying to create a Flux Kontext nudify lora (an actually good one). To make the dataset, I am taking images of nude women in various poses, and prompting the model with "The woman is standing. She is wearing [insert outfit here]."
I will automate this process so I can generate a dataset on hundreds or even thousands of images. Now, the problem is that ~30% of the time, the model makes no changes whatsoever to the image, literally just copy pastes it pixel by pixel. How do I avoid this? It is preventing the automation. With a 30% failure rate, manual review and rerunning things will be very time consuming.
>>105904017this is actually against the eula and bghira will do the needful
Anything new on the realism coomer front or are lustify and bigASP merges still the GOAT
changed F to T with kontext xoxo
>>105904079There should be new versions of lustify and bigasp coming if they havent changed their plans
>>105903906>loraNone of them do what I want to make videos of.
There needs to be much more activity in this space but there are too many vramlets.
>>105903978I want to compare with Comfy.
Literally just 1girl, standing at 832x1216, Euler, Simple, 30 steps.
>>105904129yeah but they're just continuing finetuning them, not doing a new version based on NAI or illustrous or whatever right? XL models start to feel a bit limited
>>105903459there's landiff capable of long vid gens but hasn't been updated in 2 month https://github.com/LanDiff/LanDiff
someone train a wan lora for Boutine
https://www.instagram.com/boutinelababes/
what are the actual advantages between comfyui and forge?
I just got a 5090 laptop and started using comfy for video. it took a while to get used to the interface, but now that I'm using it it's not too bad (coming from automatic a few years ago)
forgefags claim huge speedups but I'm not a ramlette so I have no clue if this is true, and maybe they are just referring to automatic
>>105904079Can you name specific merges that are good? I'm looking for some good realistic porn model myself.
>>105904283Haha damn communists
>>105904308Biglust is okay but those bigASP merges are all pretty similar imo
>>105904153NTA, initial loading+first gen takes <30 sec on comfyui, subsequent gens take ~11 sec on my 5090FE with those setting (flux dev).
>picrel
>>105904182No both are completely new versions
>>105904151there is too much shit software as well. we expect the models to do everything automatically but most models are shit and need some sort of tooling to guide it
>>105904307forge has better img2img support options. comfy has optimizations that will regularly break itself so you have to constantly go through a humiliation ritual to stay current
What's the best way to generate chinese prompts for wan2.1?
>>105904307>>105904433should have asked too, is there any possibility of video diffusion on anything other than comfy in the near future?
>>105904509https://github.com/deepbeepmeep/Wan2GP
>>105904518>can't load custom checkpoints
>>105904307forge is a slow, buggy piece of shit that doesn't support most models. has tons of snake oil extensions built in. but the UI makes some things easier than comfy.
I used a1111 and forge for years but gave up after many breaking changes,crashes, etc. have never had any such issues with comfy.
Can anyone go into more detail on the speed optimizations for wan in the rentry guide? Specifically this part:
>fp16_fast : remove --fast from run_nvidia_gpu.bat.
>Sage Attention : remove --use-sage-attention from run_nvidia_gpu.bat
>AdaptiveGuidance : set the AdaptiveGuidance node to a threshold of 1
>Torch Compile : right click on the TorchCompileModelWanVideo node and click Bypass
>TeaCache : right click the TeaCache node and click Bypass
How do each of these things affect the quality of the gen? Like what kind of specific effect will --fast and Torch Compile have on the final result? And how much time in % is each one likely to save by being on? I know there's a node comment saying Torch Compile is a 30% speed boost.
>>105904420>we expect the models to do everything automaticallyOnly a retard would expect this, this tech is in its infancy
thinking
md5: 7fb21231903cd314dc4e314e7212c11e
🔍
>>105902244 (OP)Is it possible to quantize the weights of vae? I understand it is already a small model in most cases, but I am wondering if it is theoretically possible to do so. Without severely impacting quality of the decoded image.
>>105904358Thanks, guess it's time to move to Comfy (sigh).
>>105904618funny you should ask. i'm doing a comparison at the moment
>>105904528>he can't change the name of a finetune to trick it into loading
>>105902244 (OP)I really, really, really like this image
Do vramlets have any hope of generating good videos?
>>105904953Yes. Just wait TM without hacky solutions like distilled turbo vaes, quality reducing optimizations like --fast, etc.
Good video gen is doable on my 12gb card. It just takes fucking forever.
>>105904307USE
SWARM
NIGGA
Literally all the benefits of built in comfy but with a solid base ui on top of it
>>105905070>glowie splash page
>>105905053>It just takes fucking forever.The whole point of making it fast is so you can gen more frequently. The more you gen, the good gens you get.
in a previous thread someone was asking why people are still using 1, 1.5, sdxl and pony as it is considered "slop" and only poor thirdies use it.
I think I have struggled with illustrious for a month trying to get good eyes, tits and puss and unless I make a super convoluted workfow, inpaint and Adetailer, it's barely passable.
Compare this to 1.5 + lora, DONE.
I don't understand how after a year, we've gone backwards with regular gens.
>>105905070> bloatware> botnetyeah sure buddy.
>>105904391Ate they still XL models or what are they gonna use as a base?
>>105905100I would personally try my odds with a few proper gens than many quickly shat out garbage.
These fast methods typically do not give you good gens to pick from.
If you want fast gens that also have high chance of being good, you need to spend money.
what should the strength be for video loras where you're trying to emulate a gesture?
xx
md5: 49a90798713c234a37a99bf04ad5bd08
🔍
>>105905119>I think I have struggled with illustrious for a month trying to get good eyes, tits and pussI don't even remember the last time I had issues with this, unless you're trying to use some unique artstyle which the model has trouble with?
Enough of the lies!
For those who are new to this hobby and are reading this, ignore the positive shilling of ComfyUI.
About a year ago, the UI started to bloat and use API nodes.
ComfyUI is only good for Flux and Videogen, but who has 24 GB and 96 GB of VRAM for video generation? Obviously, then, the API nodes will start to have meaning.
Don't waste your time on ComfyUI.
Loading GTA 6 on a PC is as stressful as loading ComfyUI. However, with Forge or Reforge, it would be like opening MS Paint or CMD.
>>105905369The fact we have dedicated anti-shills like this is concerning. ComfyUI is making someone very upset, which means it's good and works.
I know popular things get an equal amount of haters, but this is sad.
>>105905369>but who has 24 GBI do and had so for like 5 years now.
first I had a 3090, then a 4090 and now I'm looking to buy a 5090 (they are still hard to find where I live).
why cant you afford a good GPU and why are you even bothering with AI when you cant afford one?
maybe get a hobby for poor people, you'd be much happier.
>>105904358Okay, I tried Comfy and confirmed that it's much faster there, but your speeds are much better, I wonder why. For me, the first gen took 40 seconds and the subsequent gens all take between 16 and 17.5 seconds. I tried replicating your settings (literally the first time using Comfy) and everything seems right? https://litter.catbox.moe/v47aeibx0bq2j39i.png The gpu and vram usage are maxed out according to Afterburner, so it doesn't look like it's under-utilizing the gpu. I usually have my gpu undevolted, but also tried genning with and without the undervolt and the results are more or less the same. Any idea why yours is so much faster?
i'm not sure why i thought changing the optimization settings would create a totally different video. i guess i read long ago that changing the step count on a given seed would do that, and then my brain just mixed it up. anyway, left is default /ldg/ workflow with the 720p model and modelsamplingsd3 set to 5. on the right since i disabled tea cache i had to use the SkipLayerGuidanceDiTSimple node that was recently added, with double layers set to 9 and single layers left empty, same start and end percent
raw video: https://files.catbox.moe/mc0nho.mp4
If a Lora on Civitai has a base model of "SD 1.5", does that mean it can only be used with SD 1.5? And if so, where is the download?
>>105905369Enough of the lies!
For those who are new to this hobby and are reading this, ignore the positive shilling of generative AI.
About a year ago, my C: drive started to bloat and run out of space.
Generative AI is only good for gooning and taking up space, but who has 1 TB and 4 TB of storage for all of your gens? Obviously, then, the Onedrive will start to have meaning.
Don't waste your time on generative AI.
Loading GTA 6 on a PC is as stressful as genning. However, with MS Paint or Photoshop, it would be like opening Notepad.
>>105905448your disk read and RAM speeds might be slower. the initial prompt processing also takes time, then subsequent gens using the same prompt are faster (see the 28s followed by 10s gens). they might also be using --fast or --use-sage-attention or on a different version of pytorch. linux is also faster than windows, generally.
>>105905369ForgeUI is still the best overall, comfy is only if you want to tinker with and finetune workflows and/or if you have autism. In 99% of cases Forge does the same as comfy while being far less annoying
>>105905598forge is abandonware
>>105905563>If a Lora on Civitai has a base model of "SD 1.5", does that mean it can only be used with SD 1.5?Yes, technically no though. Using loras not trained on the base model you use don't tend to work well. The reason it works in some cases is because the base model was already trained on the SD1.5 lora.
>And if so, where is the download?The download for what, the lora or SD 1.5? You'd need to search for SD1.5 checkpoints. There's dozens of fine tunes and shitmixes.
happening
https://huggingface.co/lightx2v/Wan2.1-T2V-14B-Lightx2v
>>105905119I struggled with IL for one day before understanding I had to use specific image dimensions.
Nowadays, you have a shit ton of good IL-based checkpoints that make it even more brainless to use.
It’s still my go to workflow for anime NSFW.
>>105905601It got Chroma support a few weeks ago
>>105905615is it lightx2v but a checkpoint instead of a lora?
>>105905598>ForgeUI is still the best overallI used Forge and absolutely hated it. I would have dropped AI completely if that's all I had to use. As a creative person, the static tabbed design limited my ability to express modular workflow designs. With ComfyUI, I feel right at home. If you like that particular UI, then I respect your preference, but to make a blanket statement that it's "the best" is being duplicitous.
>>105905355Try:
https://civitai.com/models/24149?modelVersionId=1151831
and this:
https://civitai.com/models/24149?modelVersionId=1540184
The only times the faces are ok are upon closeup, but anything full frame body is a mess.
And it's weird because any other model doesn't do this at all.
>>105905619is 1024 the problem? >.<
>>105905634i think its like fusionX except without the shitty loras fused in.
>>105905266It depends on each individual lora, rest of your prompt, whether you use other loras, etc.
No one can give you a one size fits all answer.
Check what strength other people using the same lora use on civit.
>>105905677unfortunately they tend to omit the lora weights
>>105905686Download the videos and read the metadata.
You should be able to find it unless explicitly pruned.
can you do matrices in comfyui somehow?
>>105905482why are the compression and channels in EmptyLatentImage hardcoded, explain yourself
>>105905070legit question what is the advantage to using swarm? isn't it just a wrapper?
>>105905780You don't have to deal with autism spaghetti
>>105905800why people always complain about that?
I started as a complete noob and it took me like an hour to figure out how comfy works.
and I wouldnt say I'm a very smart person to begin with. so how fucking retarded do you have to be to struggle in learning comfyUI?
>>105905827It's fucking annoying
>>105905827the real autistic people cant deal with noodles or open ended designs. they cant focus. they need everything to be structured the exact same way, all the time.
maou
md5: 0aef625d03ec9ad476a0f9d9abfef1d0
🔍
>>105905668I just used Mistoon_Anime v1.0 XL. It works fine. I don't have any of the issues you mentioned with any XL checkpoint I use.
>is 1024 the problem? >.<No, because I use 1024x1024 quite often without any issues either.
It would be helpful if you posted your workflow or exact prompt + loras you used.
>>105905827you have autism
How the fuck do you use Krita?
You can in theory paint a bunch of colours, you make a sketch and that is enough for getting guided generations but it's not doing anything at all.
>>105905575My SSD and RAM should be speedy enough, it's a new build. I'm on Windows 11, so I guess if he's on Linux maybe that contributes. But the difference is so big it got me worried... Gonna try running it with these args next.
>>105905863>>105905900Mixed signals getting here.
>>105905854>>105905827Not my fault you can't make a non shitty UI and a program that doesn't break every 10 seconds.
>>105905938t. tinkertranny
>>105905937you'll never get a real answer. just use what you're comfortable with. you won't get any brownie points for using forge or comfy. nobody cares
>>105905938works fine on my machine and I never had issues with comfyUI.
>>105905704i'm trying it on various vids i've downloaded and the majority don't have the metadata. it's tough
>>105905938Install the portable version.
>>105905884I should note I use an eye detailer(eyeful v2 bbox), but heck, even my base gen had near perfect eyes anyway.
>>105905962All I need to know whether it is for autistics or not
>>105906006>anus detailer
>>105905448Not entirely sure (I'm not an expert, just started a month ago) but try following the WanX (video) guide in the OP as well as it help you install some optimizations like SageAttention/Triton/PyTorch which I don't think is included by default. The .bat auto installer in the guide does install the old version of SageAttention though (SageAttention 2.2 aka SageAttention2++ is the latest which supposedly improves the speed on RTX 40xx (sm89) and 50xx (sm120) GPUs, which is what I have installed).
>>105906019someone can argue either ui is 'for autists'. it's all baseless shitposting.
>>105905884you don't need to post results but try this with mostoonAnimeNoobai:
>A1111 weight interpretationmasterpiece,
1girl,
solo,
high quality,
(ultra highres),
teenage girl,
(((large breasts))),
(((looking at viewer))),
(((body facing viewer))),
green eyes,
red hair,
medium straight hair, (((shoulder length))), simple hairstyle, parted bangs,
woman,
slim,
medium breasts,
happy,
naked,
nsfw,
flat colors,
full body shot,
in frame,
intricate details,
detailed eyes,
detailed face,
stunning body,
(((simple background))),
white background
Negatives:
(((X-Ray, xray))), ((long neck)), ((black and white, b&w)), (DoF), (blurred), (bokeh), (speech bubbles), chromatic aberration, deformed body, ugly face, extra arms, watercolor, sepia, worst quality, low quality, lowres, poorly drawn face, bad anatomy, blurry, watermark, signature, ugly, artifacts, bad image, anime, tail, ponytail, armpit hair
Euler A/Normal
CFG 7
Steps 35
seed 707886075805066
Control net name is nobaiXLControlnet_openposeModel > set union type openpose > apply control net str=1, start%=0 and end%=0.35
The reason I used that controlnet is to get a fullbody pose as it won't otherwise, it stops at the navel.
The result I get is ok until you check out the eyes, which are messy, the nipples are these super faded shadows and the pussy is this super thin and unfocused slit. I can't describe it.
>>105905405Overall, Comfy is the best inference solution, it supports practically everything and it is updated very often.
Forge was a challenger, but now it's practically in maintenance mode, although it got Chroma suport recently.
Swarm is bloat wrapping around Comfy, pointless.
InvokeAI is proprietary with a crippled freeware version and slow to support new stuff, still if you are deep into inpainting, it is the best tool.
Comfy could be much better, so many simple things require third-party-quickly-deprecated extensions, but beggars can't be choosers, Comfy is the best we have.
>>105905780good inpainting interface
great file library management system
presets
a lot of stuff baked into the base interface like autoseg
can switch between the ui and comfy on a fly
>>105906060Don't listen to him, forge is better than comfyUI.
Comfy has the linuxtard approach where you can do 1000 useless tasks no one asked for and no one uses, but fails in the most basic stuff like having a dragging system that works, not breaking every second and a """"comfortable UI""""" designed for human beings.
That program was made by an antisocial autist and it shows.
>>105906001Just grab a random value like 0.7 and increase or decrease it depending on how good it is working at this point.
>>105906029Thanks, I'll try following the guide. Did you make any optimization on top of it?
>>105906060If Comfy just had a proper gui
>>105903935thats why vramlets cope with distilled or some fast variations that look like shit
just use chroma instead, 20-40s per gen 30 steps but great for realism
>>105906025>like having a dragging system that worksThe dragging works fine.
>not breaking every secondCustom nodes breaking isn't Comfy's fault. The UI undergoes significant improvements and changes, so it's only natural the api/backend changes to make things more optimized, streamlined and sane. This is normal.
If you have a particular grievance or bug, open a PR. I've linked both the frontend and backend for your convenience, Anon.
https://github.com/Comfy-Org/ComfyUI_frontend/issues
https://github.com/comfyanonymous/ComfyUI/issues
I'm sure the team would be delighted to hear the alleged problems you have. I'll be looking forward to your post.
>>105905100just gen at full quality but stop after 3-5 steps if you see it went to shit
>>105905369>ComfyUI is only good for Flux and Videogen, but who has 24 GB and 96 GB of VRAM for video generation?you need 16gb vram and 64gb ram and you can gen at max quality for wan without ever even swapping anything to ssd, if you dont have at least this 400$ worth of hardware in your pc maybe the bleeding edge tech isnt for you?
>>105901308Increase epoch while reducing repeats.
>>105905827I've been using comfy for two years and I'd rather just click across menus than drag the fucking workflow around every time ai want to change domething
>>105905914just follow some youtube guide
>>105906126Just the installation of SageAttention 2.2, replace the following line in the .bat autoinstaller that the guide provides;
%PYTHON% -s -m pip install sageattention==1.0.6
with
%PYTHON% -s -m pip install https://github.com/woct0rdho/SageAttention/releases/download/v2.2.0-windows.post1/sageattention-2.2.0+cu128torch2.7.1.post1-cp39-abi3-win_amd64.whl
>>105906029How did you update the old version of SageAttention the bat file installs to 2.2?
>>105906240sage att has no impact on quality
fp16 supposedly does but i haven't noticed any real quality loss
adaptive guidance sacrifices a bit of quality for speed up to the threshold value
not sure on compile/teacache. unless you have an H100, you really shouldnt be disabling them anyway because they all have acceptable quality loss.
>>105904017You need to train the text encoder for the Flux lora to work more consistently, even if it's only the L clip.
>>105906192why does it do a mouse click when I let go of the mouse? i mean, if i click to drag the workflow and then let go it will click on whatever the cursor is hovering on. many times i have cancelled gens by accident because i was dragging the workflow and let go of the mouse while it was hovering over the red X. what is up with that?
>>105906192please just fucking fuck off shill nigger yoland
>>105906277>rather just click across menus than drag the fucking workflow around every time ai want to change domethingBro, what are you even doing? Do you not know the Bookmark (rgthree) node exists? You set a hotkey and it zooms in on that portion of the workflow. I never have to drag anything.
For example
Number 1 = setup
Number 2 = samplers
Number 3 = previews
etc
>>105906325>i mean, if i click to drag the workflow and then let go it will click on whatever the cursor is hovering on.uhh what. i use the middle mouse to drag the workflow and that doesnt happen?
The only issue I've had with ComfyUI is that it's fucking stupid that you have to download 1 gorillion custom nodes to get things that should just be core features. And that doing so breaks shit every time you have to update.
ComfyUI has no hires fix button. That makes it worthless.
>>105906374oh, i've been left-clicking. still, it shouldn't do that
why did comfy hire mcmonkey if hes not going to include swarmui stuff in mainline comfyui? wtaf
>>105906389I have 114 custom nodes installed and do bleeding edge updates. Haven't had anything break in months. Even when it does, the node devs are usually on it in a matter of hours.
>>105878823>litterreup pls
>>105906462magnet:?xt=urn:btih:1EC7CFF7B831111C15589E16C806C94649B5DCC9&dn=Stable+Diffusion+FLUX.1+Kontext+%5Bdev%5D+removed+LoRA+collection&tr=udp%3A%2F%2Ftracker.leechers-paradise.org%3A6969%2Fannounce&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337%2Fannounce&tr=http%3A%2F%2Ftracker.openbittorrent.com%3A80%2Fannounce&tr=udp%3A%2F%2Fopentracker.i2p.rocks%3A6969%2Fannounce&tr=udp%3A%2F%2Ftracker.internetwarriors.net%3A1337%2Fannounce&tr=udp%3A%2F%2Ftracker.leechers-paradise.org%3A6969%2Fannounce&tr=udp%3A%2F%2Fcoppersurfer.tk%3A6969%2Fannounce&tr=udp%3A%2F%2Ftracker.zer0day.to%3A1337%2Fannounce
>>105906347didn't that dev leave because comfy wouldn't implement qol despite having the code right there on a silver platter?
Anon do you have some info to share about models that do not work on diffusion but reconstruct the image pixel by pixel? Any link to a paper/repo/thread? I forgot to bookmark any of this shit
pls anon :>
>>105906500I think you're thinking of cubiq, rgthree nodes were updated just a few days ago.
>>105906423mcmonkey is with ani's company, not comfyorg
>>105906514when is he just going to give up like him?
>>105906558It understands english captions perfectly fine, why would you want to use chinese ?
>>105906558i think you're wasting your time. i don't expect any translator to work too well for chinese
>>105904458>>105906558unless you are a native chinese speaker the translation will lose out more information than any miniscule improvement from writing the same prompt in chinese
>>105906192The dragging doesn't work fine. If you make a node UI the first thing would be to ensure dragging and clicking has no errors. And the UI is a mess, you can't identify anything there
>>105906449>I have downloaded 114 nodes and see no problemsThis is exactly the problem.
You can't even do simple operations in a node UI without downloading 100 extra tools.
ComfyUI is not comfy
>>105906719All those custom nodes I have are for personal niche things I want to do. I don't think they should all be in core comfy. My main workflow only uses maybe 10. WAN workflow uses 10. Post processing workflow uses maybe 20. Others are experimental niche case things.
>>105906785again, you are making cope excuses. there are nodes that should be ootb yet they would rather make api nodes
>kontext can remove watermarks on all stock images
istockphoto BTFO
>>105906900Yeah and it's much easier than doing it manually
>>105906882we just got sub graphs, a very powerful feature. and despite your hate, supporting api nodes was important. cloud models like veo3 are obviously ahead of local models, so being able to use comfy as a ui to interface with them keeps comfy relevant across the ai space.
>>105906900yeah it's pretty good for that
>>105906932man, shit the fuck up shill. sub graphs should not have taken years to get in. it should have been there on fucking release. i don't know who the fuck they hired but 90% of the comfyorg team is made up of fucking retards
>>105906943also if you replace text it can replicate the font, even if the original letters aren't present. or re-pose people or make them do funny stuff. it's a great edit/manipulation tool.
>>105906947>complains about shilling there are anons here that constantly bring up how forge is better>will never see this anon call a forge praiser a shilllol
>sub graphs should not have taken years to get inwell, it's here now, so what exactly is your problem? its like you trying hard to find reasons to be angry. i dont get it
>>105906990forge is free and is spread by word of mouth by users. comfy hires shills to advertise in organically on a Mongolian basket weaving forum.again you are defending incompetent priority making of the devs. I use legacy as well because they bloated the ever loving shit out of the shill frontend that hogs respurces
>>105906990there is no subgraphs in legacy btw
>>105907014>forge is free and is spread by word of mouth by users.comfy is also free and spread by word of mouth y users who post in this very general.
>comfy hires shills to advertise in organically on a Mongolian basket weaving forumdo you have solid proof of this?
>again you are defending incompetent priority makingI'm not saying it's perfect, but some of you anons making extreme comments like this
>>105902251 has me concerned. it's not normal to hate software you don't have to use that much. something doesn't add up. it's almost like the hate is personal, but that's just my observation.
we getting a dedicated i2v lightx2v checkpoint or lora or something?
https://huggingface.co/lightx2v/Wan2.1-I2V-14B-720P-Lightx2v
from: xbox graphics is hiring
>>105906990I use reforge for noobai/illu anime gens, it's way faster for using tag autocomplete, civitai helper for lora management, adetailer, reactor, controlnets, etc.
but for video and kontext comfy is fine. once you have a good workflow you can just reuse it.
>>105903411I expect it to be the same as wan 2.1, but slightly better.
>>105907091you keep saying it's faster but that's extremely vague. what is faster? ui response? generations? inference speed? that tells me almost nothing. comfy is fast for me. tag autocomplete from pysss works fine, lora manager by willmiao is MILES better than civitai helper. It even has a chrome extension so you can seamlessly queue loras for download. adetailer already exists as a node, controlnets work fine as nodes, etc. there is absolutely nothing I need from forge.
ai "art" isn't real art you faggots
>>105907142that's an awful amount of loli, sir. mind explaining yourself
>>105907142civitai helper is great for lora management and I can do controlnet stuff or adetailer or reactor without needing to draw a bunch of nodes/connections.
comfy is fine when you have a set, static workflow.
>>105907061if comfy was good there wouldn't be any valid complaints. you spend most of your time arguing with people that already made up their minds. the future is not comfy. get over it
>>105907181>civitai helper is great for lora managementSo is Lora Manager, hence the name.
>I can do controlnet stuff or adetailer or reactor without needing to draw a bunch of nodes/connections.It takes all of 5 seconds to connect the noodles.
>comfy is fine when you have a set, static workflow.That ideally should be the end result of all workflows. Once everything is connected it's a matter of changing a few parameters to get the results you want. I don't think a 'dynamic workflow' is even a thing.
>>105907188>if comfy was good there wouldn't be any valid complaints.What kind of logic is that? Krita has valid complaints. Photoshop has valid complaints. Blender has valid complaints. Literally every software ever created has a small minority of users that hate them, which is why alternatives exist. Does that mean all those software are bad? No. There will never be a unicorn UI that pleases everyone.
lora: https://tensor.art/models/852362314505033772
finetune\checkpoint: https://tensor.art/models/864231482397327022
>>105907259krita has a tranny problem, photoshop is a victim of corpo enshitification and blender's licence sucks ass so not many people are willing to contribute because they can't reuse the code to make money implementing it into studio tools. every software has problems, shitty software has retarded problems like
>>105906325 existing since it came out years ago. comfyui is just a way to string scripts together from people who have never used art software before. THAT is the fucking issue and why so many people complain. it also crashes all the fucking time because the "ecosystem" is full of volatile deps or is just a grift
>>105906558Just use google translate.
>another episode of anistudio being perma butthurt comfy threw him in the trash can
>>105907234If I want to do controlnet stuff in reforge I just tick a box, or two. or for adetailer. it's just simpler for doing anime gens.
comfy is fine for wan 2.1 and kontext though. but if I wanted to add functionality other than a save image node, then I need a rocket science manual to figure out what nodes I need and what connections.
>>105907303try not to delete this post for no reason this time kek
>>105904528>>105904797its not the checkpoint that is the issue for me
but i have issues fiddling w\ offloading additional LORA to system ram
i am a vramlet these days;
in these unknown days
in these final last days.
on \g\ ;3
>>105907080The cartoon girl is squatting in a street in India. There is garbage all over the street.
accurate Microsoft hire:
What settings do you use for upscaling with chroma? Same as ultimateSDupscale?
get the fuck out of /g/ you dont belong here
you are faggots janitors you are faggots for letting this ai shit ruin /g/
>>105907525oh haii ;D
so just to confirm? we are non-fren now?
>>105907080kek
>>105906497i too always have trouble w\ the eyeballs...
even w\ teacache off it still bungles it sometimes
>>105907540>biggest tech since the internet is popular to talk aboutwooooow
>>105907554teacache method is shit compared to the lightx2 lora from the rentry, and slower
>>105907582something tells me you don't feel so confident in that
i understand your frustrations, but why post crappy sonic art
>>105907448i just use sdxl models for upscales bc chroma is so slow. I don't use USDU either, just straight up gen at full res (can even go to 4k) with tile controlnet
>>105907606do not ask speak to barneyfag
>>105907606Maybe he was one of those shitty deviantart sonic artists that used to get commissions for his shitty comic strips, and now nobody pays him anymore. very sad indeed.
>>105907582>not real artnobody cares how something is classified, im still making images and earning money from them
>ai will be dead in a few fucking years proof?
>>105907575im interested but
again, im a vramlet
& i dont want to break my existing workflows
if light can be implemented into comfyui
i perhaps can consider it - as i am interested;
what about all my existing wan lora? toast?
& my nsfw finetune? :c
>>105907703https://rentry.org/wan21kjguide#lightx2v-nag-huge-speed-increase
it's much much faster than default wan, so you can make more videos faster, even 720p is fast now. for vram, use multigpu node + virtual vram (add more if needed)
>>105907575I went back to teacache because I didn't like the motion in the lightx2v workflow. There was often this feeling the characters were moving in rewind.
>>105907430The cartoon girl is squatting in a street in India. A speech bubble beside the girl has the text "SAAR! DO NOT REDEEM!".
>>105907611using the ghiblishit makes you indian yourself
>>105907751seems fine at 0.8 to 1.0 strength, the ability to make 720p gens and not need 20 min is pretty amazing imo
>>105907721how much faster are we talkin?
it currently takes me 30-40 mins per render
if i upgrade to a 3090 oc maybe i can cook faster....
>>105907783Funny because I was given advice to lower it to 0.6 to help counter the frequent dimming. Any time I put in a brightly lit image, x2v would always dim it immediately and there was nothing I could do about it.
x2v worked fine with some images but teacache is more reliable and consistent in my case, even if the gen time is twice as long.
>postcard and rocketfag and kontext autist all emerge at the same time
jfc
Has anyone tried this?
https://openart.ai/workflows/whale_harmful_43/video-to-anime-consistent---wan-21-vace---long-length-low-vram/PuuwljtepF5sWPKJW5Wy
>>105907806>postcard and rocketfagliterally the same poster btw
how do i stop genning accidental futa on chroma oh god just end me
>>105907767??? where's the ghiblishit?
>>105907611How do I use that? Do you just put a resize node betweeen a load image node and the apply controlnet node?
>>105904307forge is not spaghetti
>>105907430>>105907753>the only style kontex is able to emulate other than flux slop is chatgpt piss kek
>>105907348Why did he? I thought they were friends?
>>105907941how do you get the water to look this good? i always thought all liquids look like shit in wan.
>>105907883"highres fix" workflow is included in the comfyui repo along with many others
>>105905482who is this semen demon
>>105907883workflow:
https://files.catbox.moe/s9qkzc.png
it's not perfect, has a tendency to change colors a bit (better than USDU does though, with no tiling artifacts) and I'd welcome any suggested improvements. it does a pretty good job and handles all art styles I've attempted, including anime.
genning at these resolutions requires more VRAM even if it's SDXL. I have a 7900 XTX so I have no issues with this. IDK if it'll run on 16GB.
recommended SDXL models for this workflow, the best depends on what you're upscaling and how you tweak the settings:
>pixelwave_sdxl11>bigLove_xl3>splashedMixDMD_v40>leosamsHelloworldXL_helloworldXL70
>>105907309>comfyui is just a way to string scripts together from people who have never used art software before. THAT is the fucking issue and why so many people complain.I'm unsure why people expected more from a guy who isn't an artist in the first place. Comfy is an autistic NEET respectfully, so of course his software will reflect that.
>>105907309>comfyui is just a way to string scripts together from people who have never used art software before. THAT is the fucking issue and why so many people complain.this is EXACTLY why it needs to die as soon as something else comes
>>105907309> krita has a tranny problemthe what?
>>105908187it's well known krita is trannysoft on /g/
https://github.com/hallarempt
>>105906943gib clean version
>>105907309>krita has a tranny problemunfortunate but how does that affect the software? most big projects have troons that strongarm their way into it because they are the ones that will be terminally online and autistic about it
>photoshop is a victim of corpo enshitification when was photoshop ever not spyware? just pirate and use in a sandbox, it always did the same job and you always needed more no matter what
>blender's licence sucks ass so not many people are willing to contribute because they can't reuse the code to make money implementing it into studio toolsstill the best software on the market for most use cases regarding 3d, fuck corporations and their own software
>comfyui is just a way to string scripts together from people who have never used art software before. THAT is the fucking issue and why so many people complainpeople misunderstand what the purpose of comfy is, the field is still young and there is no "correct" way to implement things and do things, new advancements, improvements, features and optimizations and coming out basically every other day, the only way you can keep up with those is if you have a modular workflow that allows every part of it to be easily changed (especially by the community through plugins)
you are supposed to use krita with it for better control and swarm ui for simpler ui if you want that, but comfyui is something to easily make and edit workflows, the actual fine grained image editing ui part that its missing should just be done in krita/swarm, implementing everything in one place is simply not possible given the work required to stay on top of everything in the field that changes every day