Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>105834473https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, & Upscalershttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanX (video)Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1
>ChromaTraining: https://rentry.org/mvu52t46
>Illustrious1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate
>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/celeb+ai>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
did the last thread get auto saged?
file
md5: 8046bd5ee358388967cc228a5da6d6a5
๐
Second attempt at a shodan lora is better I guess.
>>105836669it did hit the bump limit
>VACE 14B is a multi-modal model by the makers of Wan, and it's designed for video to video and video editing. There's no /ldg/ workflow for it at the moment
but I want a /ldg/ workflow
it's neat how kontext can figure out fonts just based on the styles of a few letters:
change the text from "Deus Ex" to "LDG General".
Blessed thread of frenship
>>105836689VACE has a ton of different use cases though and most would all need different workflows. For instance I've been using a workflow specifically for extending a video.
>>105836669it must have auto saged to about page 6ish, there's zero chance it fell off beyond page 10. It got removed somehow otherwise thread baker here would have linked to this thread.
>>105836743>>105836669You guys mean archived early, not autosaged.
fuck, forgot text
>>105834544>>105834518>>105834500any WAN chads want to post some comparisons to chroma on the same prompt?
>>105836848>>105834544wan can generate images too?
>>105836669It got nuked. I donโt get why theyโd nuke the whole thread instead of just deleting off-topic spam. Takes like 10-15 seconds to spot the nonsense posts. But we know the general where janny usually lurks, so itโs no big surprise.
>>105836882buddy a video is a series of images
>>105836903What general is that?
>>105837004You don't know?
>>105836882Yes set frames to 1
>>105836669Based Dragon's Lair Enjoyer
is chroma still improving
>>105837070yes, it's the best furry porn generator there is!
>>105837070Some Anons in these threads disagree but I think it does from checkpoint to checkpoint.
I will continue trusting the plan, I'm still having a lot of fun with it.
You know what actually worries me the most is i have no way to avoid this apart from fleeing.
to people that re just arriving welcome to hell...
its not like i could just stand infront of people and say don't kill them! they will bash my head in with bricks and kill you anyway. few people understand England, its probably the most violent and vile population on the planet and you're walking around in it like you fucking own the place...
>>105837164Yeah I think it's improving
like these immigrants arriving here don't know we are on the brink of civil conflict and i have no way to stop that. you're walking right into absolute danger, the local chavs right can organize and they will fucking kill you, that is a fact its not a threat it just is how edge lords in my country operate. Anything north of Yorkshire is total insanity mode, because if this goes up no where will be safe.
>>105837196here lad >>>/int/212546228
i will run i will not fight because war is retarded but i just wanted to warn anyone in the uk that is not white while i still can make your plans now.
>>105837173nah I get it. all the good genetic stock fled that island and now it's some actual dystopian shithole like 1984. kek, fuck england. you fuckin suck
>>105837173>>105837196Are you ok? Are you posting these in the wrong thread or something?
>>105837210no i had to warn them because it might get real serious at any time and if that happens the anger in people which i can control a lot actually in my self won't be the same for your average townie shit head high on cider.
i hope i am wrong but they will and are looking to place blame and believe me the UK can be a scary place.
>>105837216nah i'm just fucked up mate i'll be ok but i had to go take my head for a shit coming off the weed is a bit nasty for the first day. so i wanted to express the real meaning of my rage and address it that it is felt in many UK men and we are some of most ruthless fighters the world ever saw its not a brag its a reality. an angry uk man would even make a Russian cry to his mom. and when that rage is direct towards people its devastating.
>>105837241Ok, well I hope you feel better dude. Put on a video game or something - take your mind off it.
>>105837241and its not the people at fault its the government and i want people to understand that. uk posters here understand its our fucking useless government not them.
because there are nice people here who are black a lot of them and they do not bother us they even goto church and some are even talking to us so it would be a real shame if people ruin that.
at the end of day i don't care about someones skin color its just not me i'm an ex military guy and do understand illegal immigration and how wrong that is but its not these peoples fault they have children now here, i will not stand by if i see my fellow country men brutalize them. for their colour or what ever.
i would be a total coward if i run now but i feel like because i do not wish to police anything i just want to live... But if i stay here that is what they will want me to do ex military after all autism or no austism i can still serve and they know that.
i will be called into arms the moment shit really goes down i won't be able to leave i will just have to do what they tell me and i'm good at that job i assure you of that which is why they will call me in. its due to my age they will need some 40 something to tell the bitches what to do because they are fucking dumb or something i thought long and hard about everything there, we old fags could walk circles around you its true. we was trained slightly differently back then and we are much cold in emotions and etc. we didn't have inclusion you do your job or you got jailed them kicked out if you refuse.
Wan 2.1 t2i is a funny ass model
tl;dr if i see you harm them i will shoot you dead no thought. i warn army stop or i fire enough times and if you don't or i thinking your actions are about to cause harm or death i will show you how fast 5.56mm moves through your chest and i will not feel one ounce of remorse.
because deep down i can't be this nazi you want me to be it won't happen it will be the opposite to the relief of people that came here.
so do not terrorize them because you will be met with deadly force.
>>105837338is it still 2 minutes per gen?
>>10583735053 seconds for 1920x1080 with 3090
i'm on the side of the people always not some government or what ever always the people. I have to fulfill my oath in this case.
i will walk right up to that police line and tell them i'm ex forces and walk right in, just saying. don't expect this guy to follow your nazi shit he won't.
>>105837389Sdg tranny hellbent on getting another thread deleted I see.
to see the even bigger picture is horrible really. its unavoidable at this point and its probably by design we got a real hard time lined right up...
>>105837408i don't have to listen to you go take your fucking head for a shit... i could in RL snap you to fucking pieces.
come fucking sunderland pal and see how we live here i doubt you would even dare come anywhere near where i live pal.
>>105837363it's the standard workflow but only 1 frame generated right?
ah fuck this good day i'm gone.
>>105837363how the hell I've been sleeping on this model?
>>105837454I installed a new portable ComfyUI exactly like https://rentry.org/wan21kjguide told me, then i took the workflow this guy had https://www.reddit.com/r/StableDiffusion/comments/1lu7nxx/wan_21_txt2img_is_amazing/
>>105837471coz I'm the only one who used it and I only post weird shit
sorry
>>105837498Yes, you bastard!
All I've seen were of Wan as single image generation were your low quality milfs, which while hilarious, didn't sell the model for image generation at all.
Then I see someone on reddit posting high quality Wan t2i stuff which looks better than most of current Chroma stuff, with good hands as well.
You played us, consequences will never be the same!
>>105837498I find it funny that one anon posting wan image gens in this very thread is completely ignored, but the moment idea hits reddit, everyone acts like it's a revelation.
>>105837603There was artsy stylized stuff, too.
>>105837616Wan images aren't anything new in here. Generation just used to take so fucking long that it wasn't worth using.
>>105837616it was day 1, it didn't catch on because it's slow and hungry
>>105837603made the collage almost every time tho
whats with this guy schizo'ing out?
>Moonvalley Marey Realism
>Sweet a new, native to comfy video model that came outta no where that's on par if not better than wan!
>API
>Moonvalley Text to Video(5s) $1.50
>Moonvalley Image to Video(5s) $1.50
>Moonvalley Video to Video(5s) $2.25
file
md5: a0da446bcd4f6a950e8ca9759278f7e7
๐
file
md5: eab1707517ebd2536be6c8a8f0cf092f
๐
>>105837498I believe I asked you about it when you started doing it, but I never test it myself.
file
md5: 92726b534632da15e68f192c0f5deae1
๐
Temu daftpunk.
>>105837650if this is the only way they make money the org is dead in two years
>>105837845more like skinny fat daftpunk. they really need to hit the gym to get rid of their boobs.
just learned about kontext
does it work for nudes and porn?
>>105838011yeah
you can remove clothes without inpainting
>>105837739>>105837675reminder, this is the type of person that claims they got 'bored' of AI gen'ing.
>>105838047nice, do you have any comparison pics?
>>105838113umm
I'm not posting nudes mate
try it yourself haha cheers xoxo
Trying to recreate a couple of gens i made with Chroma, exact same prompt (not very good, can be probably improved with better one)
>>105822965
>>105838251That chroma version is damn nice. I bet we'll see something like that during our lifetime
file
md5: 2a57e136cba4be9c1c957242343f0f86
๐
>>105837650It generates nice horse cock
https://video-editor-files-prod.s3.us-east-2.amazonaws.com/public/videos/385fdba5/original
>>105838216>kick off a huge batch of gens>come back an hour later>forgot you had seed set to "fixed"
file
md5: 463c44aec58c06adf2e36d86b27f0fa5
๐
>>105838251prompt for pregnant robot waifu?
Redpill me on these:
>Chroma
>Flux Kontext
>LTX Video
Chroma as I understand is a flux finetune. I looked a bit but it doesn't seem particularly noteworthy or impressive to me? I see it getting posted a bit here that's why I am asking.
Flux Kontext; is this just a "remix" oriented version of the flux or a brand new model in terms of capability? I am assuming it takes similar resources to run as flux since it is a 12b model as well.
LTX as I understand is low quality compared to Wan and Hunyuan, but runs much faster than these. Some say it is worthless trash, some seem to consider it acceptable for low end workflows. What is /ldg/'s opinion?
Also I probably should confess that I am a 12GB Vramlet.
>>105838394Now you have many backups anon, feel safe.
>>105838395A photo of a android woman, she is chrome colored and shiny, she is a pleasure model, she is wearing lingerie, artistic, she has a lit up glass container on her stomach with liquid inside which looks like a pregnant belly, dramatic lighting,
>>105838400>Flux Kontext; is this just a "remix" oriented version of the flux or a brand new model in terms of capability?It's the freemans version of OpenAI's "turn this man into a woman and make the image ghibli style" or "make him hold a sign that says 'KYS'"
>>105838400>ChromaBased on Flux Schnell and adds actual CFG back (including negative) and also fixes censorship and plastic skin / flux chin etc. It's a good model but like all other successful models (SDXL, Flux), it will only show its potential with further loras and potential finetunes. We will see.
>Flux KontextImg2Img solution where you can easily change certain aspects without any inpainting/masking, it suffers from all the problems with Flux though, generic faces, plastic skin, massive censorship.
>LTX VideoNever used it.
>>105838510Thanks
>>105838478>It's the freemans version of OpenAI's "turn this man into a woman and make the image ghibli style" or "make him hold a sign that says 'KYS'"I guess I gotta compare how it hols up against sora now.
Is there a rentry or workflow to get started with kontext?
Oh I forgot to ask, for chroma what is the difference between "detail calibrated" and normal releases?
>>105838598>The "detail-calibrated" epochs are recommended, as they are divergent versions of the model from when it began to train at a higher base resolution (1024x1024 vs 512x512).My bad I just had to read.
I wonder what is the technical reason for these two versions?
>>105838598The 'main' Chroma branch is trained at 512x512 resolution (the last two epochs will see a bump to 1024x1024), but there is a second Chroma branch that has been around for short while which is training at 1024x1024, and IIRC the 'detail calibrated' is basically a merge of the 'main' 512x512 branch and the 1024x1024 branch.
What is the place for loras now that civitai bans real people and flux kontext nsfw? DHT? A forum perhaps?
Can I use the required system prompt that NoobAI needs to alter it into a specific direction?
>>105838685The technical reason is that no one knows shit about fuck, including how many high-resolution epochs are optimal, so now we have one branch with two high-resolution epochs and one with ten.
file
md5: 7f91967ea347c2729a4e5d7a29d5b583
๐
>>105838854We're in the dark ages now.
Tensor.art is very much uncensored still, but searching for models there suck.
chain
md5: 7fbf02b125ebbed527721d269130ee52
๐
Chain I do this in comfy somehow?
>>105838934Load Image from outputs beta node might work, combined with any LLM custom node
>>105838934pay for a llm node then sure. the local llm nodes are all shit
I like really like Wan t2i, but chroma still has more SOVL at the moment.
>>105838934You essentially want an LMM to judge image quality and iteratively improve it? (Like those agent memes)
I don't think any local models can do that in a worthwhile manner yet.
file
md5: d7f1cbe909af687f777aaaa915792d51
๐
>>105838993No I want a RP llm to react to images of her actions and reactions that prompt another set of environments for the imagegen to gen to move and react to.
>>105839006automated or are you happy to do something after each iteration?
>>105838934>>105839006I forgor to add that in the middle of each black arrow is a processing mid-station where the user would alter the prompt "issue orders"
>>105836977The artificial wife...
hi, where the fuck is radial attention?
>>105838992speaking as a chromasister who has never used wan, wan honestly looks like a far better base model, the furry should have just trained wan instead of mutilating flux. but as it stands, I am not seeing the same levels of sovl from base wan. what a shame
>>105838400ltx video 0.9.7 distilled 13b is indeed fast but the video output and adherence to prompts is terrible. It's also censored and wont show nude genitals.
>updated comfy
>the program icon has changed from a catgirl to some shitty C logo
holy sovlless
>>105839248corpoization of comfy
>>105839029Each gen would be like a game turn.
>>105839248you have to use the legacy frontend. the nu-corposhit one is slow garbage
https://rentry.org/SP123-2018
I have found the Colombian Supreme Court ruling. They even allow CP deepfakes of real children
In compliance with the principles of legality and strict typicity, and without extending the law to aspects not previously contemplated by the legislature, the following forms of pornography are not punishable under Colombian law:
i) Technical child pornography, involving individuals who are not minors but appear to be so, either because they physically resemble minors or because technological tools are used to create that appearance;
ii) Pseudopornography, where real minorsโ images are inserted into pornographic scenes in which they did not actually participate (meaning they were not abused);
iii) Artificial child pornography, involving minors created from an unreal pattern, such as drawings or animations of any kindโi.e., they do not represent a real human being."
so host your loras in Colombia for the time being it seems. photoreal AI images/videos seems to be covered more by ii), while lolicon is covered by iii)
>>105839322actually i'm wrong, it seems that i) completely covers anything WAN could ever create for t2v, and ii) covers i2v
>>105839354Chroma v41-Few-Steps
>>105839248have you tried genning a replacement? it's a 520x520 svg in the assets folder.
>>105839195Asking the real question.
How do I run chroma? Where's the info for it in the OP? It's only training info in there.
>>105839469https://files.catbox.moe/hlbgby.json
>>105839280After first gen
Add a load image node
Right click the output -> send to workflow -> [current]
>>105839492But I mean what model do I use? What frontend? A long time ago someone link me this but idk why you're gatekeeping even the basics. It's not like there's some guide of FAQ I failed to read.
https://huggingface.co/lodestones/Chroma/tree/main?not-for-all-audiences=true
https://huggingface.co/silveroxides/Chroma-GGUF/tree/main/chroma-unlocked-v34
>>105839094>>105838953>>105838851here your (you) anime for your beautiful anime girls if it makes you feel better :)
>>105839532You can get vae and text encoder t5xxl in the model downloader within comfy I think. Pick a chroma version depending on what you can run.
>>105839553Guess I have to bite the bullet and use comfy after all. Thanks.
When I follow tutorials on how to make loras for clothes, they all suggest to tag your concept as well as everything else in the image (the scenery, pose etc) but I had far better results when tagging only relating to the concept plus one or two variations (hands in pockets, no logo shown when viewed from the side) - so are you supposed to do the former or not?
>>105839581Tag the stuff you want to be changeable and prune the tags you want to be unchanged/bundled with the activation tag.
>>105838854It's still CivitAI for anything other than celebrities and Kontext NSFW which the BFL faggots has put as against their license.
Here's a bunch of Flux celebrity loras, no idea what the quality is though: https://huggingface.co/malcolmrey/flux/tree/main
>>105838908Tensor.art is 90% exclusive shit you can only run on the site, it's absolutely worthless.
>>105839532Chroma is in forge
>>105839581Both ways can work. The most important thing is to use correct tags. With simple clothing loras less is probably better.
>>105839657Thanks. So for images where it's a side view, how does putting "from side" mix with the model's version of that tag? I just put from side as normal, right?
>>105839656And the Vae would be the one they provided on huggingface?
>>105839675what model are you using as base? you wanna use similar tagging style as the base model
>>105839410'Cum Dodgers in the 25th century'
>>105839688Illustrious. Was still using SD 1.5 until I found out I could actually run it on Comfy on my PC
>>105839677it's the same fucking vae as flux. you can probably just dl it off of civit. they called it ae now because as usual researchers fucking suck at naming conventions
>>105839729>it's the same fucking vae as fluxI mean yeah I know that now because I'm reading through the thread and I see that it's a flux finetune. Again, there's no information about it other than lurking and reading what people have said. All the image generals seem to have this problem but I can't be everywhere at once all the time. The OP could use even a basic update.
>>105839726https://danbooru.donmai.us/wiki_pages/from_side explains it really well. Using https://github.com/jhc13/taggui/ and wd-eva02-large-tagger-v3 with 0,35 minimum probability usually gives correct tags, but I recommend removing tags like male focus, genderswap, parody etc.
>>105839750No one's authored a guide (probably) because ComfyUI has a "default" workflow that I think is in its github examples section, the Forge implementation is kinda new, and the model itself isn't finished training. Once it's finished training I'm sure someone will write an actual guide.
>>105839750>>105839917>because ComfyUI has a "default" workflowIt's also apparently not even the correct implementation.
But yeah the model isn't even finished yet so of course there isn't a spoonfeeding guide
>>105839646Yeah, that too. I managed to hijack some models using their comfy implementation + some hugging face nodes, but its not a very successful method.
>>105839917the comfy examples are shit every time unless you plan on genning fried cfg slop fennec girl, don't use them
>>105839997>you plan on genning fried cfg slop fennec girlYou know me too well anon.
>>105838953slop face
slop eyes
>>105839094same
slop face
slop expression
>>105838908tensor is censored as hell, it doenst let me generate adult woman in underwear.
I made some things, inspired from lovecraft's short stories. Original, I know.
What upscaler would you recommend for oil painting-style pictures? I can only do 1.4 latent upscale before OOM. I'd like a crisper result. Ideally, some kind of upscale that gives a brush strokes effect
>>105840118Check ultimate upscale, it splits image into tiles and you can do 5x upscale if you want
Hello, have a newbie question.
I know about v-prediction, but does Forge auto-enable it, or do I need to flip a switch like I do with ReForge in the advanced sampler settings, or something like that?
So does anyone use Automatic1111 anymore?
>>105840167Hory shet, it's great
If it does a "tiled" latent upscaling, why does it need an upscale model?
I gave it my general-use 4x ESRGAN but I have no idea what it really wants
>>105840473You have to upscale the tiles before you resample them, and even if you resample, using an AI upscale model can potentially yield better results than non-AI algorithms like bilinear or lanczos. The difference is negligible, in my experience.
>>105840433Forge or reForge now. They're forks of AUTO.
>>105840230>>105840363Use the latest full version not beta, they basically removed the repo gatekeep yesterday. It has much better artist knowledge even though it still knows surprisingly few.
So why is dual clip loader a thing for Flux and its derivatives? Why do I need both clip_l and t5xxl_fp16 ?
>>105840642more is better of course :^)
does VACE work with the light2x lora, for speed? or first/last frames?
Any advice on how to get it to run on nvidia5060. Using comfyui and when running my prompt get a cuda pytorch error.. CUDA Error: No Kernel Image Is Available
>>105840118>>105840522oops lmao I tried ultimate upscale on my already upscaled picture, it made 130 fucking tiles, and since I forgot to remove an old prompt, now there is a cave on every tile, and some other lovecraftian horrors here and there
>>105840733That's impressive actually
>>10584071450 series cards need CUDA 12.8 and the latest Pytorch version.
>>105840714> how to get it to runGet what to run?
>nvidia50608 or 16 gig version because it matter a lot
> CUDA Error: No Kernel Image Is AvailableYour install is borked
Run a pip check after source venv/bin/activate ing the virtual environment Comfy is installed on
>>105840794Stable diffusion
5060 8gb and thanks!
>>105840681ok, it does indeed work:
>>105840814all I did was use the comfy template and swap it to the gguf multigpu node. and the lora works, 4 steps.
and heres one with vace + video reference
Miku Hatsune dancing:
could bump the framerate up but it works!
Survey time:
https://strawpoll.com/XOgOVDj1Gn3
>>105840880this is with the light2x lora, default before was causvid I think?
also it's funny how the lora is for t2v but works perfectly fine with i2v. is an i2v one being made or is it even necessary?
>>105840885vramlets RISE UP
>>105840433Opened the a1111 bat file on weekend and testing some gens with it. It still generates images but is so unbearably slow and vram hungrier than reforge on 4060ti 16gb. I don't see a point using it anymore but don't want to nuke it for nostalgic reason. I believe sd1.4, sd2, 2.1, animatedeff and sd3 are compatible with a1111 but not with forge/reforge right?
>>105840433Forge works significantly better and is basically the same thing
>>1058408128gb sucks but SD(XL) can still be done there. I was able to run it fine on my previous 8gb GPU.
Again, your install is borked, update/verify/reinstall
>>1058408853060 GANG RISE UP
>>105840885Where's the option for multiple?
>>105840955reforge is my go to for all my noobAI (usually wainsfw v14) anime gens, it's fast and civitai helper is amazing for loras
for video/wan I use comfy, dont need a lora browser cause I only use a handful.
>>105841049you should probably use this if comfy is exclusively for videos.
https://github.com/deepbeepmeep/Wan2GP
comfy is too bloated for just videos
>>105841066is this only for vramlets or is it just a good solution for Wan in general? Comfy is genning pretty fast
>>105841066im happy with comfy as it is now, the bookmarks and templates are a nice addition, plus my workflows for video are saved so i'm leaving it as is for now
what I like a lot about comfy is how it's modular, you can add/remove stuff how you like.
>>105841085it has the same opts pretty much. could probably just use the same venv as forge. at the very least everything won't fuck up when you update unlike comfy. I feel like I lose all the speed after a week and have to re-setup everything with the next humiliation ritual comfy wants us to go through
How much of the 6.5 gb SDXL model is the unet, how much of it is the text encoder? I believe the vae is around 319 mb, but the rest?
>>105841109If mine fucks up again (probably when Wan 2.2 and Radial Attention finally exist) I'll probably switch or at least play with it. So far it was hard enough to get it working how it is, and I got a good thing going here for the moment
What's the deal with token count for sdxl (specifically using vpred noob in comfy rn)? I was reading about some prompt stuff and saw people saying you should keep token count under 75. This whole time I've been going wayyy over. And I think most of the example pics I've seen around have been too. Is it worth trying to stay under? I feel like it gives you like no space at all... Or can you concatenate prompts?
>>105841360I've spent a lot of time figuring this out recently. Of course, you can go over. The only problem is it'll automatically split your prompt into 75 token chunks.
The usefulness of ensuring different concepts sit inside discrete chunks is, in my experience, questionable, although possibly checkpoint-dependent. The biggest issue is rather the automatic chunk boundary being in the middle of a tag, because one tag (even one word) can be made up of multiple tokens. When that happens your tag can be misinterpreted or ineffective.
>Or can you concatenate prompts?You can use the "Conditioning (Concat)" node (or Impact's "Concat Conditionings" for a slightly more compact way), there are also custom nodes that support using the BREAK keyword. It's really hacky and early stage but right now I'm testing a custom node that supports BREAK and also automatically inserts BREAK keywords where CLIP would normally silently split your prompt.
>>105841360concating two clip nodes is worth it. Tokens at the beginning of the prompt has way more weight, concat lets you control where your prompt 'restarts' rather than it ending up in some random spot.
>>105840429ComfyUI general
SDXL general
Get out
Which model do I use if I want to make images with this body shape?
p.s. doesn't have to be Asian and looking only to make images, not video.
>>105841430>Tokens at the beginning of the prompt has way more weight,This is true but if you notice a tag isn't effective enough you can also simply raise its weight. Of course, this can become very finicky when you start adding tags in the middle of your prompt and your chunk boundaries move, so manually managing chunk boundaries can also be useful in this way.
>>105841437>ComfyUI generalnah, forge users have more interesting outputs most of the time
>SDXL generalnah, this is a chroma general
>get outno u
>>105841449any model with the tag "1girl" should work
>>105841449try transvestitebugcreaturemix
>>105841449>>105841465To be more precise, I'm talking the thigh shape.
Most models I've used don't have that smooth curve of thick thighs without having exaggerated hips.
>>105836657Do you know if there are any pirate websites or torrent sites where pirates download exclusive checkplints and loras from Tensor Art and publish them there? As happened in the 2000s and 2010s with Ares or video games?
>>105841066Look at the license lmao.
>>105841462Like /aicg/ has SillyTavern as their main UI, we need to define our UI that we can all contribute, otherwise it will be a mess.
>>105841521that would be anistudio whenever ani gives the go-ahead to jump in. I am so sick of jeet/trannyscript garbage uis
>>105841049Which branch do you use of ReForge?
>>105841520>>105841066It also has a bunch of telemetry from gradio and the author but those who use that software deserve it anyways.
>>>>105841420>>105841430>>105841452I see, thank you :) Another thing I've been confused about is for stuff like getting a pink letterman jacket whether I should stick to raw booru tags (letterman jacket, pink jacket) or if I should use some natural language to reduce the redundancy (pink letterman jacket). Or for objects like if I have 'holding water gun' should I also include 'water gun' as a separate tag?
>>105841552comfyui has telemetry from the custom node manager and API nodes so what's the difference?
>>105841549idk the current version, I did a git pull like 2 weeks ago, it works fine.
Which one do I use Forge or Reforge? Pros and Cons?
>>105841560Natural language tags can work but sometimes they also don't. For example I've seen stuff like "pink hair ribbon" being interpreted as "pink hair, ribbon" so I tend to stick to danbooru tags if they exist.
>if I have 'holding water gun' should I also include 'water gun' as a separate tag?I don't know. I tend to include "water gun" in this scenario but I've definitely noticed that it's not always necessary.
>>105841585Forge
Pros:
It's still alive (not really, but it is not straight up abandoned like reforge)
Cons:
Same cons as reforge, no video gen, missing some extensions and optimization compared to comfy, etc.
>>105841599now do pros/cons of comfyui
kontext edit, wan 2.1 to animate it
source: blizzcon red shirt guy
Give the man a black top hat. He is pointing a silver revolver to the left.
wan prompt: the man points his gun and fires it several times, the gun flashes with each shot.
>>105839248I don't like the C logo but I didn't like the fennec girl as an icon either. It was very unprofessional and would've made branding difficult in the future. Look at all professional logos. Almost none are anime characters.
>>105841599So ComfiUI it's the way. Besides the intrincate UI and the huge memory leak, is there any cons?
>>105841657you somehow made image/vidgen just seem like the lamest fucking thing. go make a project already holy shit you boring motherfucker
>It's called ComfyUI
>It's no comfy at all
How can they cheat us like that?
>>105841667> and the huge memory leakwhat the fuck are you talking about
>>105841462>nah, forge users have more interesting outputs most of the time(You)
>>105841671it's an example of using an edit for a video, can you please go kill yourself instead of annoying people in this thread? you whine more than a bitch on her period.
>>105841667no, comfy is going through serious bloat and is completely unstable compared to a year ago. heavy amounts of frontend enshitification too. there simply isn't any right answer for UI frontends right now because all of them are shit
720p Q8 gguf model, bit more time but higher quality:
>>105841686you've been doing "examples" for weeks at this fucking point. please fucking stop? nobody is getting anything new from your "contributions". you have no imagination with the slop you post.
>>105841701I am going to keep posting JUST because it makes you upset.
>>105841678ComfiUI itself consumes LOTS of Vram
>>105841706also, who are you to speak for the entire internet, you are one person: statistically irrelevant. I do not care that you, 1 of 8 billion people, dislike the post.
>>105841631Sure
Pros:
Maximum capability
Maximum options
Maximum customization
Maximum performance
Cons:
Incredibly autistic UI that will never stop being a humiliation ritual to use
Weird bugs like "VRAM lock" that needs a reset and other stupid shit that forces me to reset from time to time as well
>>105841667It sucks but it is still the way if you are serious
>>105841687>>105841735I would like to add the poisoned well if useless custom nodes in community workflows
Is it just me or does ComfyUI freezes the pc every few WAN gens?
Isn't there a ComfiUI Forge, less bloated?
>>105841774ComfyUI it's super buggy, every now and then I have to watch youtube ads to keep gening
>>105841774it's turning into civitai levels of bad
kontext to add Trump to the Epstein cell block
then wan: man with a knife opens a door on the right and walks inside.
first try, became a house door on the inside kek
>>105841774Yep, not you.
It's locking the PC circa a sec, up to a few times while using heavier models like Flux and video gen.
okay. this one is much better:
>>105841714Standalone runs like shit, use portable variant.
>>105841862electron still runs like shit and has telemetry
so is comfyorg just wrapper chinks and comfy just implements a model every now and then?
>>105841797that trafficking chomo walked right out of there without a scratch
big bill passes, case dropped, diddy free
cool game.
>>105841674kek
there are other browser web interfaces for normie-types
>>105838851>heterochromiamy fav thing to prompt ! cute, CUUUTE!
>>105838085niiice doggy, stay down doggy ;o
kontext to put lord Todd on a throne with a title, then:
A man sitting on a throne stands up and holds his arms out, with a smug expression.
>>105841774Nah, never happened to me. Might be running out of ram and causing it to slow down.
>>105841885>ywn encounter team rocket erika as a young trainer in a forest at night, before her pokemon uses sleep powder on you
>>105841881yeah, pretty much. not much of a future
>>105841920Do you guys know why ComfyUI eats so much system RAM when running WAN?
Shouldn't the models be loaded only on my gpu's VRAM?
I have 64GB of RAM and I see python3 alone eating 52GB.
>>105841924>multi-timeline theory being canon means it COULD maybe happen someday ;_;https://www.youtube.com/watch?v=ayMze2qoKvc
>>10584198652gb of snake oils
>>105841986yes, most people dont have the vram to run wan, and if you run out of vram it will use system ram, and having little system ram will slow your entire computer down
doesnt matter what ui you use you will have the same problem.
>>105841986that's normal if you're genning long or high resolution videos
>>105842006I don't have this issue with wan2gp
A man rides his bike off a ramp, launching it high into the sky.
>>105842026if you are just doing /v/ shit just go post in the /v/ thread
>>105841986If you don't have enough vram it will offload currently unused parts of the model to system ram and then swap it in again when needed while offloading something else that's not needed.
>>105841952>>105842026WHO EVEN IS THIS UGLY NON-FAMOUS RETARD
>>105842029he's just going to sperg then samefag again. I gave up trying
>>105842042Todd Howard, creator of Skyrim
>>105842029you are posting nothing and only complaining, you are worse than a woman.
what is the point of wan2.1_flf2v model if wan2.1_vace can do First-Last Frame stuff?
>>105842023nobody is using that shit anon stop shilling it
>>105840642>>105840658I tested a bit and it does seem to provide better results on average.
I wonder if there is a non-meme answer as to why?
>>105842063flf specializes in interpolation between two frames while vace is more general.
>>105842075???
It performs that way because it was trained that way, it doesn't mean that was the right way to train it or that the cost of loading it and using it is worth it. Clip gives a signal to the model to maybe help performance, purely speculative, and likely as a form of censorship because Clip can lie.
https://blog.novelai net/novelai-diffusion-v2-weights-release-b9d5fef5b9a4
NovelAI released their v2 model, anyone can contact NoobAI and tell him?
>>105842097I think I misunderstand something.
Isn't CLIP_L just another text encoder like t5xxl?
>>105837872you can't come from nowhere with a paid tool, lmao. even big companies tried to be free, in the past. except shitjourney
>>105842006Wan doesn't take 52gb of VRAM
>>105842037what about addition video lora? can you have it do that too? i want to experiment with multiple but OOM and they are not loaded\used
>>105842154it honestly did more reputation damage since comfy said in the threads he would never let comfy be a wrapper for apis
>>105842120It's a version of CLIP like SDXL uses. t5xxl is more like an LLM but it's just the text encoding/decoding part without functional logic like an LLM, but it more or less turns words into numbers that are useful for mapping to latents but does not have any training about images. CLIP is a text encoder that also knows what images of are. You might see where this can be a problem for certain images.
>>105842159torch compile set to use a bunch of cache does
You gain speed by disabling the torch compile node in wan
>>105842214wait you've got me mixed up. anon did say wan itself used 52gb vram he was talking about ram, and that large amount of system ram usage comes from torch compile not wan
>>105842194I suppose I should try a more NSFW version of this test later then.
And also try a separate prompting for each encoder, perhaps giving clip_l broader tags and t5 naughtier details.
>>105840885ayymdbros.... not like this
Are there nodes that warps, twists and distorts gens? Similar to photoshop warping parameters where you can kinda control it?
>>105842280Hopefully AMD can become a viable option to NVidia for AI in a year or so, right now they are sadly practically unusable.
>>105842280I have a 6950xt. It's better in some ways to a lot of nvidia cards, just because the vram is 16gb, which isn't 16gb nvidia level, because of the translation layers. But still, it's more than 12gb.
>>105842106should i be excited?
>>105842309i genned this with a 7900 xtx, amd is far from unusable. just don't be a Windows zogslave.
>>105842369>amd>i genned these two gay dudesnice bait m8
>>105842386ok, first of all how do you know they're dudes? Did you just ASSUME? that makes you the ass-gay. You're gay. You're the real homo thinking the homo thoughts here. GAY!@ GAY GAYG AYG AYGAYGAY
>>105840642CLIP was the original way to make sense of the captioning of those monster datasets from Laion, so it still has a hand in translating some of that stuff for T5... at least that's how I remember it being explained.
>>105842211Very clean, guns and fingers used to be borderline impossible. Would you be willing to catbox? Curious what types of guns and poses this can handle
>>105842421the ride never stops
the summer never ends
the glowposting is ceaseless
the schizo-posting accelerates
nothing ever changes in these days
in these final unknown times & days
truly we are in the last of days, its almost over..
. . . T H E . E N D . . I S . . . N E A R . . .
>>105842443When will Kromatext 1.0 be released?
New SLG implementation is finally live.
Verdict?
https://github.com/comfyanonymous/ComfyUI/pull/8759
>>105842299no but i imagine it'd be easy to make one.
new idea for a wan video, a cock with a tit. That should be maximum nsfw.
>>105842528memory holed but this is something I didn't really need, it will probably be the same level of gacha
>>105842538thats 90% of >>>/gif/vdg
>>105842528i didn't care for it when i was trying it out earlier this morning. kept getting weird glitches but i only tried a few different settings on it.
>>105840885>91 total votes woaw
>>105842528i doubt anyone will notice the difference, but a step in the right direction regardless.
>>105840885funny how most have 3060. that must be pure suffering.
I hate comfyUI so much goddamn
>>105842576As someone who cannot vote due to VPN, I conclude that around 150-250 people visit this general throughout a day
>>105842540This is quite nice