Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>105916388https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows/home
>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanX (video)Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1
>ChromaTraining: https://rentry.org/mvu52t46
>Illustrious1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate
>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/celeb+ai>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
Any news on the video gen front?
where do I start for local 3d mesh generation? /3d/ has a thread but it's years old and just laughing at the bad quality of the early stuff. I've seen some very good mesh generation models recently but I'm not sure if any of it can be run on consumer hardware.
>>105921149Not really, everyone waiting for Wan 2.2
>projectile vomit and Rule34 of the flood from Halo made it into the college
based
>>105921161Any worthwhile improvement expected besides optimization for vramlets?
>>105920969a fellow rinne lagrange enjoyer i see.
>>105921138 (OP)>>WanX (video)>Guide: https://rentry.org/wan21kjguide>Edit: 25 Jun 2025 02:55 UTCHas there really been no updates since?
full
md5: c631b72bc452da5649ef898bdeb76bfa
๐
Hey Faggots,
My name is John, and I hate every single one of you. All of you are fat, retarded, no-lifes who spend every second of their day looking at stupid ass pictures. You are everything bad in the world. Honestly, have any of you ever gotten any pussy? I mean, I guess it's fun making fun of people because of your own insecurities, but you all take to a whole new level. This is even worse than jerking off to pictures on facebook.
Don't be a stranger. Just hit me with your best shot. I'm pretty much perfect. I was captain of the football team, and starter on my basketball team. What sports do you play, other than "jack off to naked drawn Japanese people"? I also get straight A's, and have a banging hot girlfriend (She just blew me; Shit was SO cash). You are all faggots who should just kill yourselves. Thanks for listening.
Pic Related: It's me and my bitch
>>105921189Nobody knows, people speculate that it will be trained on a longer duration, but again, could be anything
>new lightx2v i2v wan lora dropped
>my 5090 can now gen 10sec 720p coom vids without motion loss
my dick thanks you based chinamen!
>>105921201Thanks John, your bitch is hot I'm gonna print out her face and skeet on it.
>>105921201why this dude posting 20 yr old memes
>>105921201ChatGPT, what did he mean by this?
i need a new character to put in the prompt desu
>>105921207>gen 10secGen 10 second long videos, or gen videos in 10 seconds?
If it's the first thing (gen 10 sec long videos), please elaborate, preferably with a catbox
chroma bringing the kinosovl
>>105921279holy kek, excellent gen
>>105921149Some happenings
>pusa wan ggufs (might still be locked to 5 secs)>radial attention has been updated to support sageattention1>lightx2v brought out a i2v lora, seems to fix some saturation and movement issues https://huggingface.co/lightx2v>ALG for better movement but no comfy mention https://github.com/choi403/ALG
>>10592120710 sec length gens? Does it not slop out and loose all of the fine details? I havnt tried the new one yet but on vace, going beyond 6 secs it looses all fidelity
>>105921279Finally
>>105920040Hey I was using prompts like this
Heavy industry rises in the northern wilderness โ cranes lifting beams, smokestacks trailing clean exhaust into a bright sky, and power lines stretching like veins across the land. A female welder and a male engineer dominate the foreground, seen from a low angle to exaggerate stature. Behind them, oil drums and construction vehicles create a strong horizontal base. Use cool steel tones, clean whites, forest greens, and accenting reds for impact. Propaganda-style speed lines emphasize vertical growth and national purpose. Composition should be layered, bold, and monumental.
Iโll be honest I have no idea what im doing but stuff like this seemed to work
I have other ones once I get home I can maybe find some. I wasnโt using any loras just straight chroma with like 50 steps
How does the fixed lightx2v lora compare to the teacache workflow? Still a noticeable loss in quality?
>>105921411noice
s a v e d.
>>105921138 (OP)blessed thread of frenzone :3
should I use fp8 or iq8 for flux and wan, and why?
>>105921470>iq8Because you can never have too much iq
>GPU had been under load all day from generating videos
>95% of that work was for cancelled gens
>>105921470q8 used to be better than fp8, but now you typically see fp8-scaled offered which is just as good if not better than q8
>>105921296He aint coming back, hes gooned to close to the slop
>>10592129610 sec long videos using kijai wanvideo workflow (used to be in the guide) in ~7 minutes @ 4 steps or ~11 minutes @ 8 steps. I've got 64gb of ram, so it may not work if you've got less.
https://files.catbox.moe/gm1d26.json
>>105921345Yes 10 seconds length, and you're thinking of FusionX that does that, this new lightx2v lora is for i2v (though 480p but still works for 720p res vids, the 720p lora version could be next as the huggin page is up for it).
>>105921138 (OP)Do I get an used 3090 (in my country crypto mining isn't really a thing and used cards mostly come from gamers updating) or a new 5060 Ti with 16 GB? Both at about the same price. The 5060 Ti maybe a bit cheaper, but not by much.
>>105921510but I should use iq8 because it's faster than fp8?
kj
md5: fdfb972894c78bc85594de2b08ca8493
๐
https://github.com/kijai/ComfyUI-WanVideoWrapper/issues/804#issuecomment-3075380477'
kijai is confused by the pusa release
>>105921541>iq8I assume you mean q8 ? And no, they're not faster than fp8, actually they are slower, but people used them because the quality was better, again however, fp8-scaled models are just as good quality-wise.
>chinese enter the FOSS scene
>everything becomes low quality, cut-corners garbage
you asked for this
>>105921565oh it's the chinese who are to blame for indians and jews cutting corners on all western software, now I truly see
>>105921578god I wish it was north korea leading the charge. USA would be in shambles
HOLY FUCK, new light2x distill is fucking amazing, it actually works for image to video now
https://civitai.com/models/1585622
https://files.catbox.moe/lt2mmw.mp4
Took fucking 40 secounds
>>105921562He should take over comfy ui and rename it kjui
>>105921591show a comparison without
>>105921596that would take like 30 mins
>prompt "blonde hair, blue hair ribbon"
>ai gives me blue hair
how can I avoid this shit?
>>105921565>chinese enter the FOSS scene>jewish schemes to make good models paywalled and API-only are thwarted, also they become feasible on consumer grade hardwareGee anon, I really wonder which side we should be on...
>>105921565jewish hands wrote this
>>105921613it used to be slavs that were bored carrying everything. even japanese were more relavent back then. now everything is chinky-winky cardboard quality garbo and it just dosn't match the saas quality. the jews are laughing at them
>hassakuXLIllustrious_v30
It seems pretty great
>>105921611>"blue ribbon on hair">blue hair in neg prompt>increase cfg
509
md5: 3e4bb6e0b2e8f48d6d62915fd1e5faca
๐
Anyone tried the double clip on Illus? I have both G and L but outputs are fucked
>>105921631Go back to /sdg/ and shill there
>>105921653what is he shilling? truth?
>>105921631>>105921656Samefag somewhere else
>>105921530uh whichever has more vram
>thread chinaman is getting aggressive
>>105921520Nope, not thinking of FusionX, I dont use that shit, I've been testing these non stop since they dropped. The original t2v lightx2v lora completely removes all the fine detail and almost "smoothes" out when you're around the 10 sec mark but you begin to see degradation after 5 secs. Then again, you're using kijai's workflows? Suppose its different as I'm using native gguf.
If you're still skeptical, create a super detailed person, load a i2v vace workflow with reference video and try it for your self. Anyway, gonna try the new one shortly.
>>105921549Oh man, you know its not looking good for pusa if he's saying this...
>>105921679When where you when pusa was pos
>>105921631If everything went according to (((their))) plan, you wouldn't even have most of the models people are using in this very thread, anon. Especially powerful video models like Wan. Yes, all of the models people use here are behind SaaS, but you better believe it they are good enough to make some cloud kikes pissed, they definitely lost some customer base even if minimal
>>105921679try the new V2 ones that just dropped today, much bigger datasets
>>105921591
>>105921695you do know chinese people are basically the jews of asia right? they both evil af
>>105921631and where did the slavs go? oh, they're busy killing each other so Jewlensky can sell depopulated Ukraine to israel and create Khazaria 2.0.
>>105921613>>105921620hilarious how everyone instantly knew who was behind that post. i'm not even right wing, just tired of jewish lies, crimes and terrorism.
>>105921695Wan truly makes the jew cry in rage, no shekels from SAAS, no social engineering control of what you are allowed to generate, truly anudda shoah
I could sell this to some gallery ngl.
>>105921711Pattern recognition is a powerful thing
>>105921591Was testing it a bit earlier and it's pretty good. I feel like it's actually faster motion wise than regular I2V but I haven't done a comparison yet.
>>105921785I think they trained it at a higher FPS but same, I have not tested it enough yet. It is night and day better, its legit 99% of the way there for like 1/20th the gen time, old ones degraded motion / prompt following a ton
>>105921712I don't use local models only. My workflow is a hybrid between chatgpt and grok and flux, because sometimes they simply give you better initial generations or photos for style transfer than local models.
>>105921153sparc3d is recent, not local however, the hugging face space if free but has a long wait time AND you won't get textures. The website has texture generation using a weaker model I think, but you can only make 5 free models.
Tripo3D is worse in detail (especially anime faces, sparc3d isn't much better) but it supports PBR generation so you can have a glossy model OR add PBR to a model that doesn't have it, and it has a bunch of quality of life features (like rigging, retopology, but it's not perfect), overall pretty fun but not that good.
I use firefox with multi-account containers to switch between accounts.
The closest thing for local models is I think that you could make a mesh from IRL object by taking pictures around the object.
>>105920594This makes way better coughdiaperscough than WAI even without any lora guidance so I might finally throw WAI into the dumpster.
>>105921851>coughdiaperscoughyou're disgusting
>>105921785Wan knows how silicone moves
>>105921849Mega pint of mega milk
>>105921851noob is the most advanced in that area
Is there a node for ComfyUI node that logs the positive & negative prompts and the noise_seed to a text file for every successful gen? I am constantly canceling video gens and editing my prompts and by the time I finally get a good one, the exact prompt for the current seed often gets lost unless I ctrl-Z back to the correct point. It would be extremely useful to have a log file that connects prompts to seeds (which I can identify by filename).
>>105921520b-b-b-b-but anon said you can't gen 10 second videos. i guess he was FUCKIN WRONG, BITCH
Is there a ComfyUI node that logs the positive & negative prompts and the noise_seed to a text file for every successful gen? I am constantly canceling video gens and editing my prompts and by the time I finally get a good one, the exact prompt for the current seed often gets lost unless I ctrl-Z back to the correct point. It would be extremely useful to have a log file that connects prompts to seeds (which I can identify by filename).
(third time's the charm)
>>105921836thanks anon, sparc3d is great. i know my way around 3d so i don't mind the cleanup or texturing.
>>105921922What's with the weird interlacing ?
>>105921934The website for sparc3d is called hitem3D by the way.
And this video was kind of insightful.
https://www.youtube.com/watch?v=jfk8e4ykp-s
>>105921966it was compressed to hell
https://litter.catbox.moe/4nvlpu1gz79zgld3.mp4
>>105921520>>105921591>>105921705Ok, I see what you guys are saying now. Just tried the i2v lora, its actually pretty damn good. Oddly enough it handles 10 sec gens quite well. Wonder how far we can push it..
Even with 1cfg NAG, the movements have vastly improved (I dont have to go to 1.5, 2 cfg kek) Based Chinaman strikes again!
>>105921922Now try to do something a little more complex than a whore dancing for 10 seconds
I swear, every single time video extension example I see are fucking whores dancing or shaking their boobs/ass, never something that truly showcases the extended time like a scene with lots of things happening, like "subject shows up, subject uses X to do Y, then Z happens in the end"
>>105921975Ok, now that's much better!
>>105921995that doesn't work well at 5 secs either so i don't even know what you're talking about.
any good ash/satoshi loras that capture the aesthetic of the anime?
Does this look Mucha enough? It knows the Art Nouveau tag, but Mucha alone didn't do anything.
This is NOT the Alphonse I requested.
>>105922012Then it's useless. Video extension will only ever be useful if you can fit multiple actions within a timeframe, at very least allowing to do something like "in the fist 3 seconds, this and this will happen, then for 5 seconds, this other thing will happen" and so on
With that said I fucking pray that Alibaba increases the time baseline for Wan2.2 to at least something like 8 seconds instead of 5
>>105921995>>105922156It's not supposed to be some sort of a video extension (that's just you getting the wrong idea). It's a self-forcing lora for WAN 2.1 meant to generate videos with significantly fewer inference steps (4 steps) and without classifier-free guidance, substantially reducing video generation time. That's all it is.
Like you said, we can only hope that WAN 2.2 brings further improvements.
I was a VRAMlet who tried comfyui on windows ages ago. Now I am a VRAMlet with 16GB and an RDNA4 GPU.
ComfyUI is Linux only for AMD right?
Any windows tools that allow for chroma, flux or lora stuff or am I doomed to go down the path of installing lubuntu and amdgpu?
I figure I should check in before consigning myself to hell tomorrow dealing with the amdgpu install.
>>105922308???
I am not talking about self-forcing/lightx2v, I am talking about the fact that those anons are making 10 second long videos
>>105922369bad gen, censored model
>>105922388Censor bar added in Premiere Pro.
The diffusion model is uncensored, I can tell you that for 100%.
>>105922399how's the futa situation?
>>105922414u might wanna delete tits
fug
md5: e107c392a397268742028552b185d071
๐
I may have messeded it up a bit
If I set hand detailer denoise high enough, can it morph six fingers into a normal hands or is it just gonna turn it into a flesh horror?
>>105922453I'd rather take the ban.
>can't recolor t-shirt because the Blue Archive tag bleeds the color blue onto it
Is this what they call 1st world problems?
>>105921728Such delusions
>>105921611delete system32 folder
>>105922503basedo
Does NoobAI need wall-of-text prompting? Everything looks like ass if I just use booru tags.
Thinking about getting back into this, been having a quick read over things. I have not proompted since before reforge was a thing, but I am seeing dev has stopped. Any point in getting into it now or I guess I dunno where to start first.
I only really used a1111 before. I tried installing comfy a few weeks back but for w/e reason I couldn't get it to recognize where I would keep checkpoints on my computer and got fed up with it.
>>105922760>I couldn't get it to recognize where I would keep checkpoints on my computer and got fed up with it.skip the pain and give it another year or two
>>105922760>I tried installing comfy a few weeks back but for w/e reason I couldn't get it to recognize where I would keep checkpoints on my computer and got fed up with it.just keep them in the designated folders?
Even I could get comfy to work
>>105921785Can my 2070s do this and do I need comfy
>>105922833I could get it to work, but after 2 reinstalls I just could not get it to recognize other folders outside its default install.
>>105922760sdxl checkpoints work great with ReForge, for videos and new stuff Comfy is must. I suggest you keep both installed.
912
md5: e945fe0493662c45838d6aaa034e3054
๐
>>105922760You can get around it by inserting the "other UI" section and it will work. Just (un)quote or add categories you need.
>>105922860Okay I'm gonna go 1 by 1 to ease myself back in
first forge then reforge then comfy
I am starting to suspect that something else was done to the new lightx2v that made it have better prompt alignment than even base Wan.
Base Wan only followed prompts up to a point or made some weird transitions, but with this distill it's noticeable that Wan now tries its best to align to the prompt.
They used RL, correct?
Playing with gpu weights was not something I had to do before, I have no idea how high or how low I should be going.
Tips for prompting with xl? heres an example of a prompt I pulled from an old png
https://files.catbox.moe/2qaykz.png
Do I need all those weights and brackets anymore? I'll admit it was probably too much back then
Also since I was pre XL, I'm not sure if I am meant to be using hires fix with it? Because watching the gen its already doing that unless im mistaken?
>jobs that wont be replace by AI
>>105923054>mfw ai will never replace ai artists
>>105923054Yeah, overall blue collar jobs will be the last to be replaced, robots are really expensive.
Meanwhile for jobs that are purely digital, it's going to be fast.
the new i2v lora is amazingly good quality for 4 steps, and doesn't seem to have problems with ignoring the prompt the way other turbo loras for WAN have
>>105921559why is int8 slower than fp8? isn't integer computation generally faster because you don't need to handle exponents?
>>105923132>inject starting prompt into a T2I>LLM analyses output with I2T>LLM puts the prompt into T2I>chain infinitely
>>105923192Where do you find these image model iq8 quantizations ?
testing the i2v lora on an old portal gen
>>105923334>An anime girl steps out of the portal
>just started playing with KontextCan I control the image dimensions or the scaling, if not, what resolution is it expecting so I can preprocess images with a better scaler.
I'm currently using the basic Comfy workflow with NAG swapped in.
>>105921189People only want length once the get used to Wan... the sluts! So that's likely the priority.
e
md5: bcc4847832b0b4f69eb3b43cfec3acc6
๐
> new lightx2v lora
> it doesn't work
amazing, local keeps winning
>>105923380no prompt (box left blank)
>>105923438https://huggingface.co/Kijai/WanVideo_comfy/tree/main
Do you mean ACTUALLY the quant being called IQ8? Because I've only seen that on LLMs. Q8 is what you normally want.
>>105923476I thought you were the guy talking about iq8 earlier in the thread, and I've never seen any such file ever.
q8 is old hat, what does q8 have to do with int8 ?
>>105923476> I've only seen that on LLMsbro
>>105923504Intredasting, I've never seen an image model with iq4, what model is this ?
>>105923512want to run it on a toaster? anistudio got you covered
>>105923523How is there even anything left in that?
>>105923523>anistudiois the slowest inference engine tho
>>105923522pretty much whatever you want but it's ggml only. vpred sdxl isn't in yet but that doesn't really need to be quanted anyways
>>105923534all the Q and K quants as well plus some more meme ones
>>105923540at least you are allowed to use the lib in a game. I've been away for the weekend but it's time to keep going
>>105923548No, I mean how is there anything left of the capability of the model on a Q1 or Q2
>>105923523Hahahah go fuck yourself
file
md5: 637ded83749a4040834bc969a8b59a83
๐
>>105923570never tried but it's taking too long and I need to sleep
>>105923285I use KJNodes. It's way better than nothing.
>>105921591>pixelated trashvramletbros...
>>105922414is this what the ytboi colonizers had in africa??? shieeeeeet
you DO gen overnight to go through it in the morning, right anon?
>>105921785Jesus Christ. Did you use a bouncing lora or just base model for that motion with a specific prompt? Also what model used for the t2i?
wow this is magical. fed this old gen of a weird creature into wan
>>105923962and got this, it did a beautiful job
I wish someone made an artist showcase website for noobAI, we already have 3 or 4 for illust, but no love for noobai
Is there a noticeable difference between using open-clip-xlm-roberta-large-vit-huge-14 vs clip_vision_h packaged by comfyanonymous ?
>>105923974This one uses noob: https://www.downloadmost.com/NoobAI-XL/danbooru-artist/
>>105921591what is light2x distill?
any noobai comfyui workflow with adetailer?
is there an open source or even closed source model that does animation extraction from video in suboptimal conditions really well? video related, but imagine them rolling around and limbs intersecting, so even worse conditions for the biped skeleton detection algorithm. The best I found so far is viggle. I guess they use metas segment anything for human detection, then mask the detected human and apply the skeleton detection algorithm. It works pretty good as long the person can be segmented, which wouldnt be possible for the pinned girl in the video. So I would much prefer something that doesnt mask at all (or only subtract mask interfering elements) and then goes straight to joint detection and tracking, even if the joint rotation is all fucked up. I'm sure there's another algorithm to fix that.
>>105924123The only difference with other workflows are:
- you'll have to use "by artistname" instead of "artistname"
- use the recommended quality tags
- your usual sampler might not work or perform well; use euler
- your usual scheduler might not work without custom sigmas, use normal
>>105924186the workflow at https://civitai.com/models/833294/noobai-xl-nai-xl
is very basic, i want some good second pass highres fix and adetailer workflow, anything good ones like that around?
>>105924029Thanks, didn't know about this one, though half the thumbnail are dead it seems
https://huggingface.co/lightx2v/Wan2.1-I2V-14B-480P-StepDistill-CfgDistill-Lightx2v/tree/main/loras
i2v light2x lora is out, localchads keep winning.
>>105924245You can use the Hiresfix and Detailer workflows from https://rentry.org/comfyui_guide_1girl
>>105924326Hang on, why is this 5 hours ago, didn't he release it yesterday? Is this a 3rd version?
>>105924271Yeah, they are, for some reason. I might refine the prompt/sampler settings on tagexplorer and also regen on multiple checkpoints, including noob.
the man in the center turns around and fires a gun at the helicopter in the top right of the image.
>>105924365first test but it's already better motion/action than my previous gens and thats 4 steps.
yeah, motion is waaaaaaaaaaay better than the t2v. need to test specifics though not just random action
>>105922414She's the most beautiful woman I've ever seen.
https://huggingface.co/lightx2v/Wan2.1-I2V-14B-720P-StepDistill-CfgDistill-Lightx2v/tree/main
the 480p one seems to work fine with 720p q8 despite that one not being up yet.
the man in the center sits at a computer and starts typing rapidly, causing the computer to explode.
>>105924402>the man in the center sits at a computer and starts typing rapidly, causing the computer to explode.is it really that easy to gen things with this video shit?
>>105924410yeah, i2v is fun for all kinds of meme videos
the man with sunglasses in the center sits at a PC desktop computer and starts typing, as large explosions go off in the background, causing the buildings to collapse.
holy pyro show
>>105924326what is the lora used for? am i supposed to use it with the distill model? i dislike that huggingface model pages never seem to have instructions
the man stands up and walks away.
from a kontext gen/edit. new i2v lora is very good and this is the 480p lora, no 720p yet but it works just fine.
>>105924527just add it to the rentry wan i2v workflow and set steps to 4. you get fast videos, it's like a turbo lora.
The man picks up his monitor and throws it behind him.
one more kontext gen -> i2v
The man turns around and enters the suit store through the front door.
motion is definitely way better than the t2v lora (expected, but it still worked fine)
Definately noticing more movement if it's in the prompt with the newer lightx2v i2v I mean, movement direction is more accurately expressed in the gen according to specifics.
In this gen it previously would hold the book close to the chest, with the new lightx2v it consistently holds the book out towards the camera.
A ran it a few times with the newer version and got the same, better, positioning of the book.
The prompt was different by one or two words on this gen, but none of the movement direction was changed.
Old: https://i.4cdn.org/g/1752078752809138.mp4
New: Picrel
The man reaches into his jacket pocket and takes out a black ski mask, and puts it on.
uhh, well it's a skiing mask, I suppose.
>>105924410beahahahahgahah
Can anyone provide me a good guide for lora training? I want to create one for a specific male anime character and there are no good ones on civitai or anywhere else.
What software should I be using? And how? Tutorial? I like the rentry guides here but the lora guide was last edited 2 years ago.
>>105923912> 1830that's a lot of gens to go through.
>>105924671>What software should I be using? And how? Tutorial? I like the rentry guides here but the lora guide was last edited 2 years ago.Nobody really knows, the most common way now is to use either onetrainer or kohya, find some lora you like on civitai, and use his training metadata for your own training
>they fixed self-forcing lora
nice. Goon game's restarted
When masking characters out for controlnet, is it important to also mask the objects they are holding? For example, if they were holding a notepad, and four fingers were behind it and only the thumb was visible. Should the whole notepad be masked?
>>105924402>the 480p one seems to work fine with 720p q8 despite that one not being up yet.yes
it takes only 80 seconds for 720p
if use your 480p i2v ligh2x lora for 720p wan video. There will be drop in quality right? or I'm doing it wrong?
>>105924642is CIA magneto?
>>105924849it still works, better than t2v in any case
>>105924542i think i am just a brainlet. are you not using the repo stepdistill model? is the only thing you actually want from this repo the lora, and you just set everything else up like normal wan?
the op guide needs to be updated
>>105924671Old guides still work. There are new optimizers however, but finding good settings is like pulling out teeth.
>>105922041>>105921785What checkpoint did you guys used for genning this?
My birthday's near, I'm alone, rarely go out, and i'm invisible to women. I'm selling my car to upgrade my PC. I have an RTX 3060 and a dual GPU compatible motherboard. Should I get another RTX 3060 for dual setup or invest in a more powerful GPU?
Improve my life or going to therapy isn't an option.
>>105925097Cool story bro. I'd go for a single, more powerful one. Even if your motherboard can handle it, I can only guess software is going to have a bigger issue, kinda like the fact our multi-core CPUs still suck ass, because most stuff is coded to be single-threaded.
>>105925097ddr5 ram 64gb min
>>105925097Can you install dual GPUs on a motherboard? What will the technician think of a man wanting to install 2 GPUs?
You need an excuse first.
>>105925097depends on what you want to do. I'd go for density and switch to something 24gb if you want to do bigger, faster.
if you want to multitask or have a richer experience with LLMs + generative (silly tavern w/ tts, image gen, etc), two gpus might be good.
>>105925146>What will the technician think of a man wanting to install 2 GPUs?>You need an excuse first.
>>105925097If you treat your birthday as special, then every other day sucks. It's just another day you're alive. You could have not woken up today, but you at least woke up alive, and with a chance to make your life better somehow. Therapy is a great way to get a third person perspective (must have for improving perception of reality), but don't forget they have their own mental illnesses too. Keep the car and treat women like casual NPC's until one stops to keep your attention. This is coming from anon with wife and dogs on acre in a city without a degree.
Here's a checkpoint from my Blue Archive wan t2v lora:
https://huggingface.co/quarterturn/wan-2.1-t2v-14b-blue-archive
It's probably done. I need to gen more and see how it adheres to the original captions. The trouble is with a dataset based on entire anime is accurately tagging characters. I gave molmo a "guide" in its prompt, and sometimes it worked, and other times not.
>105925114 >105925125 >105925133 >105925146 >105925162
Thanks for the help!
I didn't know multiple GPUs are better for multitasking.
I'll choose one powerful GPU.
Thanks, I'll delete my post soon but do a resume for another anons in same situation
Theme: Multi GPU or single GPU?
Data extracted:
-Preferably powerful single option. Even if your motherboard supports it, software will still struggle.
-For density, choose 24GB. For multitasking and richer LLM experiences (silly tavern w/ tts, image gen, etc), two GPUs may be better.
>105925217
>Therapy is a great way to get a third person perspective
I don't want to go to therapy for that same reason, I don't like the idea of thinking in the third person. Having role models about an โidealโ lifestyle is the prelude to other mental illnesses.
Married men and single men both have problems in life. I pay my taxes and in my free time I want to surrounding myself with colourful anime girls. As long as I suffer and feel pathos it means that I'm not crazy.
>>105924961I mean it works but the face is all pixelated. As opposite to old lora, very high quality pixels but no movement.
>>105925097>>105925304>Improve my life or going to therapy isn't an option.>Having role models about an โidealโ lifestyle is the prelude to other mental illnesses.kek'd
>https://huggingface.co/Kijai/WanVideo_comfy/tree/main/Lightx2v
here we go
>>105925269BASED
we need a repository for this kind of Loras, isn't there any page that support that? MEGA? mediafire?
Should flowmatchsigmas be used for lightx2v?
>In June 2024, the author of Forge announced that he planned to rewrite Forge's internal implementation, which would break most existing plugins.
>He described the new Forge role as "an experimental project to test new features," and advised most existing users to return to the original AUTO1111.
Your answer Forge cucks?
>>105925483The UI is simpler than spaghetti interfaces. It's easier and more intuitive than pushing a graph around on a "playing field".
Testing out the new lightx2v. So far it seems a lot better than it was before.
I want to change from WebUI to Comfy, but I need these two things.
WebUI switching Checkpoints is exhausting. I have to go to CivitAI to enter the checkpoint name, select it, and check prompt format, samples, steps, CFG, and config every time.
1- In Comfy, can I save each Checkpoint config in workflow format (e.g. NoobAI.json, WAIv14.json) for easy load and inmediatly use?
2- Does Comfy support V-pred? How can I use it with NoobAI?
Optional 3- Are there any unique features in WebUI (samples, schedulers, extensions) that are not in Comfy?
>>105921138 (OP)>wan guideHow do you increase the quality for the wan lightx2v workflow? Do you increase num_inference_steps? Does it make a difference for lightx2v?
The rentry guide provides a lot of tips for this in the teacache workflow but not lightx2v.
>>105925650https://rentry.org/comfyui_guide_1girl#total-beginners-explaining-the-basic-text2img-workflow
I kinda feel bad for the lightx2v guys, they will have to train again another model for Wan 2.2 soon
>>105925677two more weeks...
>>105925650>In Comfy, can I save each Checkpoint config in workflow format (e.g. NoobAI.json, WAIv14.json) for easy load and inmediatly use?Sure, that's what comfyui workflow files are, I guess.
>Does Comfy support V-pred? How can I use it with NoobAI?You used to need a node to turn on vpred but now it's enabled automatically when you load a vpred model, I think.
>Are there any unique features in WebUI (samples, schedulers, extensions) that are not in Comfy?Don't know, but I doubt it.
>>105925677they can't re-use their existing datasets?
>>105925650>Optional 3- Are there any unique features in WebUI (samples, schedulers, extensions) that are not in Comfy?Not missing, but adetailer and regional prompting are worse in comfyui
>>105925718Depends on how Wan 2.2 will be
if the number of frames, durations and compatible resolutions change, they will most likely have to rework their datasets
Stop mooching and pay for your software and data.
>>105925677at this rate, wan2.2 would have to be seriously amazing before I switch. The quality of the results I'm getting from new light in 2-3 minutes are too good.
>>105925708>, I guess.>, I think.>, but I doubt it.
>>105925097>dual 3060sare you retarded? wtf is that good for? take your 3060 and throw it in the garbage and get a job
>>105925866why are you so gullible? make others work for you, or better yet, make something work for you, so that you do not have to pay it a wage
>>105925650STOP!
DO NOT SWITCH TO COMFY IF ITS NOT FOR VIDEO
-Not sure if your PC's good, but COMFYUI interface is heavy. 90% Itโll crash between gens and you will have to reboot the UI.
-Memory leaks
-Made for paid API nodes.
-The dev is a piece of shit who comes to get attention.
-That dev wants Loras to work only with Comfy because he thinks he can pressure others to comply, as he mentioned on 4ch few days ago.
-More philosophical but last year it adopted telemetry, backed by other AI startups, losing its prinicipal objetive of an integrating local UI and bloating the UI with unnecessary Comfy specific features.
>>105925885this is a tranny developing a rival ui btw
>>105925885this is based and truthpilled, heed anon's words
>>105925879oh you're a neet. kill yourself
>>105925887>abandoned ui vs rising api driven uiso sad
>>105925888I think light only works with 4 steps.
sad
md5: 25d8f4f4695dc6deb0aad602a7c1fe5b
๐
>>105925906Oh ok.
Are there any other quality optimizations you can make? Or is it a one-size fits all thing?
I'll be sticking with ComfyUI for the next few years thanks. No-one is going to be able to catch up for at least a year in terms of ease, usage and performance. Weekly updates too. Gone are the days of git pulling and the whole install is fucked, such as with other frontends.
>basically describe CEOs
>kys neet
kek
>>105925925disregard that retard. i'm pretty sure lightx2v is intended to be used at 8 steps but you can use 4. you can use more than 8 if you want too.
Why do American AI developers hate chinese AI developers so much?
>>105925943Foreskin envy?
>>105925269This is great. Is the dataset mix of images and video clips? Thanks for sharing anyways
A1111 boomer here, somebody redpill me on Vlad's SD next, is it actually good?
>>105925927>Gone are the days of git pulling and the whole install is fucked, such as with other frontends.>>105925650I'm this anon, how do I install comfy or update if it's buggy?
>>105925937the usa has gone all in on cloud ai so naturally they seethe when the chinks keep breaking new ground in local ai.
The seething about deepseek was especially fierce.
>>105925269shere it i BA general in videogames or anime generation in 2ch here are 3d sloppers
>>105925943Because they're releasing models for free instead of pay walling everything like a dystopian capitalist shithole.
>>105925966From the github. DO NOT INSTALL THE STANDALONE VERSION that you'd find when normally googling for comfyui. It's genuinely atrocious. Portable is a must.
>>105925960>A1111 boomer hereHere's another. Try this https://github.com/67372a/stable-diffusion-webui-reForge
>>105925981my question is why isn't the chinese government doing anything about it? you'd think they of all people would aggressively try to monopolize it.
>>105925960>>105925669Both UIs have potential, but key extensions for image quality and generation are missing. Important ones like HiresFix, CFG rescaling, Adetailer, Skimmed CFG, modern sampling, FreeU, self-attention guidance, and ControlNET are either absent or basic.
>>105925927Comfy itself doesn't ever break, but it does occasionally break some extensions when a dependency it uses updates and the extension requires a lower version. For example, the latest Comfy broke MMAudio because it uses Numpy<2.0.0 but the latest version installed Numpy2.3.1. Thankfully I got it working by downgrading numpy back to 2.0 without any issues
Poor boys lacking gpu, github fastsdcpu is worth a look. Images in seconds on arm64 or Intel with openvino support.
the new i2v light is fucking awesome, no more dim lighting or slow motion
>>105926009They're also being undermined. They were angry with DeepSeek because the developers managed to train Deepseek for under $1M while getting the same performance as OpenAI, meanwhile OpenAI and such are asking for $100M+ in funding. Now the investors are getting skeptical of just how much money these American Tech companies need if the Chinese are doing it for a fraction of the cost.
>>105926010>github fastsdcpui have a i9 12900k can i gen in seconds?
>>105925966There's a install manager, depending on the problem you can disable nodes, uninstall nodes etc, image your current setup and restore it if a new node breaks something.
I've honestly not had a problem i couldnt fix for about a year. Oh I tell a lie, sageattn fucked my install but tbf i was being haphazadly autistic when i did that.
Just make sure you have a virtual environment, install into that and back that up before you start adding all the goodies to play with.
>>105926027OpenAI create their dataset
DeepSeek ctrl C and ctrl V their data
37
md5: 7a68259d505abfa12612139119cdbaf7
๐
>>105926030>>105926010The memes write themselves kek
>>105926005This is fair, that does happen.
>>105926027>the developers managed to train Deepseek for under $1MI think this is bs, probably cost way more. They are doing this to have even more bragging rights. That being said Deepseek is great and deserves all the praise it got.
>>105926035Oh, but that's a lot of shit to consider! It's not comfy at all!
I think the extensions you are mentioning are for generating video.
What if I have 3 comfis:
one for txt2img
another for Flux
another for videogen
Can I have these 3 separate without having to do any of that?
>>105926027>YOU STOLE AMERICAN CODE FROM OPENAI! SHUT IT DOWN!
>>105926044"sree" means "to shit" in russian. kek
Wait do people really still use abandonware like reforge?
>>105926074because we want to avoid this
>>105926035
>>105926067Why the fuck do you need it separate?
Is he right?
>>>/pol/510536513
>All chink "open source" is just the CCP, and it is full of backdoors.
>>105926074why not? it still does everything forge does and faster.
>>105926082Whereas the US tech is trustworthy and secure...
>>105926074>ReForge>XnView>Gimp+darktable+g'micZoomers will never understand
>>105926002>>105926018>After all these years I will have to learn comfytrannyUIgrim
>>105926084The developer literally abandoned it because he felt it was getting worse lol
>>105926044Bloody bloody! Still I like guys project. Gets a pass from me.
>>105926067You can use all three methods of genning in one install in the 1 environment, it is fine.
>>105925937>i'm pretty sure lightx2v is intended to be used at 8 stepsIs it a power of 2 thing?
>>105925960Seems ok for Linux distros
>>105926002In terms of SwarmUI
>HiresFixincluded, just not called that
>AdetailerI think the object function does the trick
>CFG rescaling, Skimmed CFG, self-attention guidanceDynamic Thresholding is available to download with 1click
>ControlNETAvailable and easily accesibile
>modern samplinghas a fair couple options I haven't seen back in Forge
>>105926115literally not the case. he said he just doesn't have the time anymore. many such cases.
and who cares, as long as a tool works for your case and there isn't anything better out there.
i use forge, reforge and comfyui. all for different workflows. i hope this info makes someone deeply unhappy.
>>105926145Ok but Dynamic Thresholding is old tech, what happened with Skimmed CFG, self-attention guidance? and most important FreeU?
>has a fair couple options I haven't seen back in ForgeForge and ReForge activate it autoamtically
Can you generate an image with a pony and an illustrious together?
>>105926169looks like she's 'choosing' the viewer
>>105926174yeah, you can have setups where you start #% of the image with one model and then finish with another or even swap them after each step, but i dont think its really worth it and idk the nodes required
>>105926174Both are EPS SDXL model, so you can either merge them or swap half way
>>105926190does not work like that
>UI stopped updating after 4 months.
>Abandoned tech.
>Meanwhile I'm still on Windows 7 with updates off.
Zoomer or Zillenial issue? ReForge isnโt dead, just 4 months without any bloat. Itโs irrelevant.
>>105926074Ohhh If the dev doesn't release a new update every day and doesn't post in Discord or in /ldgt/ (local diffusion general tranny), the software is completely dead!
>>105926211wangblows doenst have any new features that you need but this tech has optimizations and improvements every month in some way for most models and every other day if you are using the newest toys
>>105926222anime themed please
>>105926124What about Python dependencies and compatibility of extensions you mentioned? I don't want to ask chatGPT every time this happens.
>>105926222pls no slop and cringe
>>105926115>stop using your hammer and use this needlessly complicated swiss army knife with functions you've never even asked for
>>105925650you don't need comfy for video. wan2gp via gradio webui works very will, gets updates and even now supports flux kontext.
https://github.com/deepbeepmeep/Wan2GP
>>105926005>Comfy itself doesn't ever breaknta but on a default install I practically have to send a manual sigkill every time to stop the process from running silently in the background eating up resources whenever I try to quit normally
>>105925997Under the Communist Party's rule, there are no private enterprises. They can easily be switched to the People's Liberation Army or state-owned enterprises with a flip of a switch. People in democracies must not forget this essence.
>>105926074because it's optimized, simple to use and gets the job done. It's not the complicated. Stop recommended shitty over complicated ui to newbies.
>>105925483What is your answer? I mean you're the only one who seems to read such garbage and perhaps that is because you're the dumb cuck using that garbage. You're the one that needs some sort of validation because they can't think for themselves.
this thread is full of npc's, its like watching a nature documentary of some hive minded creatures and it makes me physically and mentally ill like reading reddit does. zoomers truly suck.
>>105925866you're a fucking npc mate you probably say stupid shit like this in every thread, go take your tiny brain and have a bus run over it idiot.
>>105926245Python is backwards compatible, you can get away with changing the env python version, if not you have your backup. If you must have something that needs higher versions just make a new comfy environment. It's easily within your skills if you know what a dependency is. idk why you even asked.
>>105925885RETARD
NO ONE FUCKING CARES ABOUT YOUR SHITTY OPINION.
>>105926514calm down double-digit bank account
>>105926438oh yeah? fuck your ugly computer biiiiiish
>>105926674"She goes easy on stuff like glass that doesnโt affect performance, attacking it and later picking up the untouched pieces to take home.
>>105921621Interesting gen. Wish there was more like this posted here.
>>105921664This is cool as hell. I would put that on my wall...
>>105923966Awesome. You've convinced me to look into wan...
>>105928687i hope that's sarcasm
>>105926438>over complicated uiif you use workflows from other people and you don't even know how the nodes work then yeah it looks complicated. A lot more can be done with comfyUI and that is a fact.
>simple to use and gets the job done. I really do see how I could get the job done as you put it using such a feature lacking obsolete none maintained piece of fucking trash, but what ever makes you happy and more power to you. But know one here gives a fucking damn about what you use to get the job done, maybe you would be better off using some saas with a very simple interface instead?
go back to /sdg/ faggot you're shitting up the thread with pointless bitching.