Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>106123208https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows
>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanXhttps://github.com/Wan-Video
2.2 Guide: https://rentry.org/wan22ldgguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y
>Chromahttps://huggingface.co/lodestones/Chroma1-Base/tree/main
Training: https://rentry.org/mvu52t46
>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage
>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
same asuka prompt, diff image:
>>106127742 (OP)Remember to help with the new wan rentry guide if you can:
https://hackmd.io/RDxlWe8mQCSUi72yUDEzeg?both
Anyone can edit. Still needs a lot of work because it's based on the old rentry. So far it's recommending Kiji and the AIO for wan2.2 workflow.
you have 7 hours left to submit for genjam:
https://forms.gle/cNvrZKnwEzfuAU7z5
I do not care for video generation.
>>106127785(except you can also submit after the deadline, same goes for genjam1)
I do not care for grapes.
Lossless >2x speedup???
https://github.com/philipy1219/ComfyUI-TaylorSeer?tab=readme-ov-file
https://github.com/Shenyi-Z/TaylorSeer
https://arxiv.org/pdf/2503.06923
Has anyone tried this? Does it work for Chroma?
>>106127690he's right tho, you can't stop the anime from yapping. If they have their mouth open they'll start yapping randomly.
>>106127814this just tells me your conditioning isn't strong enough. You're supposed to use the negative prompt too.
I have the ability to generate videos but I have yet to do so desu
>>106127836my problem with videos has been they usually don't really add more to an image. just some rudimentary movement that was implied anyway. but the last couple days, I'm now tempted to start because these wan 2.2 gens are significantly better.
How do you add loras to the kijai workflow?
>>106127899hootttttttttttttttttttttt
>>106127899Which Lora did you use?
>>106127785desu there is a very good chance i submit another gen to replace me previous entry
>>106127899nice res
i need to buy more ram asap
Sorry for possible offtopic.
Is there anything out that can be set up locally for audio/voice generation? Big plus if it can take context from or be included in the video generation.
>>106127916I'm a loramaxxer https://files.catbox.moe/oi0tgg.mp4 workflow
loras https://civitai.com/models/1730935/secret-sauce-wan-21 https://civitai.com/models/1755105 https://civitai.com/models/1585622?modelVersionId=1845678 https://huggingface.co/Kijai/Wan2.1-Fun-Reward-LoRAs-comfy/tree/main
It's upscaled with https://huggingface.co/Phips/4xBHI_realplksr_dysample_multiblur/tree/main (it's fast as fuck by the way) then interpolated with GIMM
>>106127979>I'm a loramaxxerwanschizo*
blue hair anime girl sits in a wooden chair and crosses her legs. she is wearing a white bodysuit.
frens
hug didnt glitch either, neat
>>106127814use NAG and put "talking" in negative
comfy should be dragged out on the street and shot
>>106127879in practice it's like 10% better than what we had before. it's good the old loras still work
anime girls wave hello to the camera.
neat that it works for multiple characters
>>106128138NAG applied,
talking, speaking, mouth open, mouth movement, in neg
she's still not keeping that gob closed
wan devs trained on tummy data too I guess:
Was there any discussion about the native comfy FLF WAN 2.2 support? Seems pretty good!!
>go to /v/
>look for a lust-provoking OP image
>see this >>>/v/717164343
>generate two semi-lewd wan videos and post them >>>/v/717165591 >>>/v/717168257
>some anon asks for sauce >>>/v/717183768
>tell him to check gelbooru >>>/v/717184008
>anon scavenges 120 pages of daisy porn on gelbooru >>>/v/717185426
>mfw
there we go, a simple prompt with a simple tag gets the obvious result:
anime girls in swimsuits turn around to show their ass.
kneel to china for making wan 2.2. local can be uncensored so it's better than the SAAS shit.
>>106128388considering you self reply often it kind of affects the lulz in a negative manner. you aren't cool or based. just annoying like any other avatarfag
>>106128431But you're a schizo, you don't matter.
>>106128426also, note the ponytails actually move around.
>>106128437your nickname is wanschizo. who are people going to believe?
>>106128431>considering you self reply oftenEvidence?
>>106128448No, that's just you attempting to project your own label onto others.
This thread has become quite hostile since that one anon started making WAN anime gens. It might be time to retire the ritual post.
>>106128473why are you trying to deflect away from your schizophrenia anon?
>>106128451every /a/ spam thread he makes he pretends to be somebody interested and posts the same replies every time. you can check the archives, it's easy to find since /a/ rarely has video OP attachments
>>106128482This anon is right.
>>106128451oh and this
>>106128489I doubt anyone on /g/ really cares about /v/tards
>>106128482>he makes he pretends to be somebody interested and posts the same replies every timeHow do you know it's the same person? Do you have evidence of this?
>you can check the archivesNo, you should be able to provide the evidence if it exists.
>it's easy to find since /a/ rarely has video OP attachmentshttps://desuarchive.org/a/search/filename/.webm/type/op/
Clearly not true.
>>106128501Anon, please take your meds. You are unwell.
>>106128505he posts the same exact videos every time. try MP4 dumbass. everyone knows it's supported now
even works with odd perspectives. very cool.
>I did not specify her clothes tear, just a red bodysuit
>>106128501>>106128520It's not worth bothering. He'll win any argument by being more willing to wallow in shit than anyone ITT combined and dragging anyone that grazes his ego in with him until they realize there are better things to do than fling shit.
>>106128520You are accusing anon of replying to himself. Again, where is your evidence? How do you know someone is replying to themselves?
>>106128527Sounds like you're replying to yourself here. Post time just past the cooldown too. Interesting.
Chroma Flash can generate some cool uncond (no prompt) images at times
>>106128527he kind of loses every argument when his ego just can't help but invent a third party to take his side. this is the wanschizo tactic. his gens suck most of the time anyways so I guess just filter or ignore from now on. at least you get it anon
>>106127715Clever workflow, thanks for sharing. Are 3 ksamplers needed?
Vramlets how are we coping with all the video stuff?
>>106128552Yeah. I'm sure most regulars can spot him by now.
>>106128552Anon, I suggest you vary up your posting style at least a little bit if you're going to reply to yourself. Does your autism make it difficult?
>>106128560badly. fuck this two model bullshit
I tried to get this working on my RX 9070 XT on WSL2. No luck. ROCm installs and rocminfo detects my GPU and rocm tests even succeed using my GPU, but if I try to use sdnext it uses my CPU no matter what. sdnext even prints rocm is turned on and my gpu is chosen, etc, but it still forces a CPU backend.
Any have some resources on how to build and spec out the RTX 6000 server cards? I have a large work budget and I got some slush funds in there I can buy something
>>106128538lets go to very distant lands...
>>106128574does sdnext swap to ram if you oom on the GPU like comfy?
>>106128591Probably idk. I saw some cpu swap settings. I gave up so I already uninstalled it all.
there we go, intact plugsuit.
>>106128582/lmg/ would probably be a better place to ask since most of them run multigpu servers
>>106128567it doesnt load all at once, I am fine with 16gb on the kijai workflow and the all in one model works on 12gb even if you wanna use that, which also works fine.
>>106128605it just sucks having to reload them constantly, really kills the speed
>>106128560we need an ANCHOR so someone animate our gens
>>106128612aio speed is fine, so is the kijai workflow. aio is 23gb but it does one shot gens fine, there are 2 models but it's just doing 3 steps for both (with the lora at 6 steps)
>>106128605>all in one modeldoes this have ggufs yet?
>>106128560Vramlets the "teehee I only have 12gb 3060 I can run wan2.2 and chroma on" or "8gb sdxl or at most lumina cope"? I'm the latter and sad.
>try anon's wan 2.1 workflow
>9999999999999999999999 errors
>see rentry tutorial
>it's a fucking book
I just want to generate videos bros why is comfy such a piece of shit
>it's a fucking book
RETARD
ok I watched +20hours of tutorials of Comfy, Where is the Adetailer?
>>106128621no need, it's 23gb but it doesn't load 23gb at once
https://huggingface.co/Phr00t/WAN2.2-14B-Rapid-AllInOne
>>106128615The default is anon having the ability to animate his own gens. We are not beggars.
>>106128632https://github.com/deepbeepmeep/Wan2GP
the guide is a noob trap
>>106128560Anyone below 48gb is soon going to be a vramlet at this point. The guys behind wan have a shit ton of funding and it's likely that they're hardly even done yet with this stuff.
>when it's another marvel slop movie
>>106128652I am a vramlet unless you are talking about ram but then again I only have 32GB of that
>>106128632use WAN's UI
use Swarm
use SDNext
use dreamstudioUI
>>106128552he's done you good.
>>106128648>he fell for comfymeme
>>106128694You don't know who it was deleted by anon. Are you going to make schizophrenic assumptions again?
>>106128676>WOOOOOORDSI
WILL
NOT
READ
>>106128676Are these just alternative UIs that support WAN?
>>106128724>Are these just alternative UIs that support WAN?yes
>>106128648ComfyUI Impact Pack, but it has 200 nodes you will have to search and configure the right for you. No, I will not share mine.
>>106128724each one with different bloat and slops, have fun
>>106128740sounds promising
>>106128755it's over
>>106128747why do these assholes make a gorillion more nodes than what's actually needed?
>>106128560Pretty good.
t. rtx 2080 8gb
>>106128023
How do I into WAN lora training? I doubt it's booru tags anymore. Also can you train loras with Runpod?
>>106128762the artist of the rhapsody game sure loved thicc legs
So I asked GPT to create a .json file with a workflow for Deph Midas ControlNet.
It gave me some schizo workflow.
What is the solution to this node/ workflow problem? Don't know why Forge gave me python issues and now doesn't work.
>>106128795you have to get a vlm to tag all this shit for you. single images only makes it stiff and video captioning sucks so it requires a lot of manual captioning/correcting the vlm
>>106127899>>106128789cum on silvermans face and tits
>>106128789make a tutorial or tell me only the steps and I will ask some AI to fill in the middle
>>106128795if its not NSFW or suggestive you could use Google Gemeni for free that has the SOTA vision with 2.5 pro
LOL. she just wont shut the fuck up. The liquid emulation is fun atleast
>>106128813>So I asked GPT to create a .json file with a workflow for Deph Midas ControlNet.why? just use the workflow from the repo. I'd imagine llms have a shitty time making workflows because the redditors poisoned the well with retarded shit on civit
>Don't know why Forge gave me python issues and now doesn't work.have you tried asking gpt how to fix it? or even here? why give up after one error? expecting comfy to be stable is a dissapointment waiting to happen
man holding a shotgun runs down the office hallway and fires it several times.
kek, the motion for 2.2 is way better
>>106128217even if it's just 10%, that was enough to push it over the edge of a qualitative change IMO.
>>106128841Forge issues heppened after I wanted to instal WAN pip requeriments, I remember a WARNING message from Cuda, then I wanted to use ControlNET on Forge and it says me that something 16fp or fb must align with other thing that is 16fp fb, at first I fixed momentary with selecting a VAE, but then the PC start to ru n slow and have to restart.
I don't know man, I want stability in my gennings, or better sayd Stabiliy in my Diffusions. I want a Stable Diffusion!
>>106128871we take anything we can get, don't get me wrong. it is concerning it's getting to a point consumer cards must quant in order to do anything. I really hope there are just specialized foundational models in the future. having everything in one pile but only gives good quality to 3d and realistic is annoying
>>106128574use comfy. sdnext is a piece of shit
>>106128903they all suck. even comfy. tired of updates breaking everything
>>106128915Actually insane detail on this one
>>106128546Chromashizo... or rather, ChromaDev:
Stop shilling your model.
I know you worked hard to create a good NSFW checkpoint, but now the Chinese have come along with something much better with less the effort.
Don't worry, this is the fate of all local model enthusiasts, they work hard for the community, and then a corpo with more money achieves in two days what you had to do in 80 with hard work and poor sleep paterns.
Let it go, forget about Chroma, you learned a lot about Difussion models, you had fun, you met new people, new experiences, but let it go, is not healthy for you.
You lost.
>>106128892im on a 3060 and im only using the fp8 scaled quant 170s per video.
when wan came out i was coping with wan 1.3b over 170s per video, optimizations are needed
technology shouldnt be limited by consumer hardware, eventually everything trickles down
>>106128838>still not using the negative prompt
>>106128951I forgot lora on. User error like usual
Ok thats it, I will do what I had to do time ago, I will delete the venv folder and git pooling again, Comfy it's a piece of shit.
Wish me luck, anons.
>>106128546chroma flash nunchaku wen..
do we have an experience to train wan loras for t2i? wan 2.1 loras are able to introduce new concepts, WAN seems to be quite flexible.
>>106128973nice is that kijai wan wf? All outputs are looking clear and nice lately, Im getting shitty noise with native lately
>>106129001tags are good, and I think that WAN can handle it well, tags and prose to complement abstract concepts or compositions its the future
man drinks a glass of whiskey in a chair.
>>106128985venv comfy? embedded?
>>106128823see
>>106127979>>1061288202.2 cumshot lora is pretty meh, gonna try the old 2.1 one later https://files.catbox.moe/v0ar35.mp4
>>106128940Oh no! Anyway, I do hope all the loras get properly retrained for 2.2
>first last frame workflow actually worked on toaster pc
it's over for my free time
>>106129028>catbox is down nooooooooooOOOO!!!!
>>106128943>RREEEEEEEE STOP HAVING FUN!!!!>This huge 14B model with slow inference, slopped image outputs that no one will do a full rank fine-tune on due to being expensive, that people are currently only using for videos. totally mogs Chroma bro, why use it at all?
>>106129016slightly modified version of this what anon linked previously
>https://files.catbox.moe/kml6a4.mp4
This pace of progress scares me. One could already blackmail random people with fabricated videos.
a character in a green dress walks out of the potion shop.
neat, it picked up the character just fine.
>>106128940nah, she's been cancelled for blackface and playing a homophobic character in a movie
>>106128892>it is concerning it's getting to a point consumer cards must quant in order to do anythingthis is just because the models and training/inference code are bloated and unoptimized. all of them. we're going to see many 10x speed, memory footprint, and quality improvements yet.
>>106128914just use linux and a venv. I have never had comfy break, and I'm on AMD which is supposed to be buggy and have poor support.
>>106128834It is nsfw. I wouldn't bother otherwise.
>>106129047People are literally making more and more WAN Loras than Chroma in a disproportionate way. What are you talking about?
Let it go, Chroma dev...
>>106129028Your loras are terrible, wanschizo.
is pony still the furry king?
anyone tryed inpainting with SD3.5 Large? it respect the style? how about objet or bodypart composition?
>>106128903I can't be bothered, I have better things to do anyways.
>>106129183Yes as base model, no as actual model, there are millions of better checkpoints now
Do any of you generate videos on your PC while using Tensor, PixAI, or SeaArt to generate images in the meantime? It's quite dynamic and fun this way.
>>106129205no, I'd have to turn in my ldg card
Reminder that the end of local generation is coming. The big cloud providers know it's a threat and they are taking provisions to shut down local. Forcing sites like tensor and civitai to remove nsfw content is just the first step.
>>106129127>we're going to see many 10x speed, memory footprint, and quality improvements yet.I highly doubt it when all people know is python. the underlying code for pytorch never gets touched, only the shitty pyshit. python is the greatest menace to the tech and it's only breeding more retarded pyjeets. we are stuck in prototype limbo until something changes. I thought llms were supposed to deprecate high level language not make everyone stupid as fuck yet here we are
>>106128789But I don't understand, is it that easy, is that legal?
Without consent? So you can take a photo of a 16y old girl and prompt the same thing and she'll have the same fate?
>>106129230the fate of having a digital doppelganger being virtually molested? yeah
Maybe someone has done it to you already, you have no way of knowing
>>106129191so you can't be bothered to use the thing that works, just to waste your time on the thing that doesn't work. ok
>>106129219I don't care, I already downloaded enough models to last me a lifetime.
>>106129228>I highly doubt it when all people know is python. the underlying code for pytorch never gets touched, only the shitty pyshitpython devs are a significant part of the problem, but won't prevent optimizations. there are smart Slavs and Chinese people who want to run this stuff on their 3rd world tier GPUs, they will figure it out.
>>106129266I already wasted 2 hours on the thing that didn't work. I don't need generative AI anyways, I would've just tried it but whatever.
>magazine has suggestive pic
you will do as I say, woman.
>>106129244Wow, in the virtual realm, I think we're going to be totally desensitized.
>>106129266>I don't care, I already downloaded enough models to last me a lifetime.Except there will be a point where progress officially stops, and you'll start yearning for cloudgen to be up to date. It will be censored, but it will be all you've got.
>>106129284I'm gonna give you the secret to tackling deepfakes, once you know this then any power they had will be instantly gone, are you ready?
they're not real
I can say that because I want you to be critical, I want you to question things, especially the media, but your handlers don't like this idea, they want to do your thinking for you, decide what's real and what's not, so you've got to consider who has your best interests in mind
>>106129266>but won't prevent optimizationstrue but it will prevent the 10x you are wanting. lately it's been snake oil bloat like rad attention and
>>106128492. don't get me wrong though, mathematicians from China are ridiculously smart, they just don't know how a computer even works to make the most out of it
>>106129266>I don't care, I already downloaded enough models to last me a lifetime.i would probably be okay for the rest of my life or at least a good chunk of it (im still young!)
>>106129291Thanks for the message, anon.
>>106129304>mathematicians from China are ridiculously smart, they just don't know how a computer even works to make the most out of itfacts. no good software engineer came out of China. I can't even name a good application they made other than tiktok but that is barebones when you get down to it
>>106129244To have unlimited access to all flavors of cunny for 5s it costs you $4k in GPU.
>>106129279nice, I remember that pic
Tried all the workflows. Nothing will reliably keep an anime character from constant taking. You just have to get lucky. it's over.
>>106129279blonde woman takes off her skirt to reveal black panties.
anything is possible with prompts.
>>106129151Most "Wan loras" I see people using are meant for porn videos. Maybe in your imaginary world people are abundantly training loras for artstyles, characters, etc like they used to with Flux. Good luck convincing poorfags to use Wan for image gen for like 5% better images while waiting 200% more and taking 3x more VRAM.
You may not like it, but Chroma gens are "good enough" for most people plus it's easier and cheaper to train loras for.
Don't get me started with other facts such as that Chroma is completely unslopped out of the box, unlike Wan, and can do things like dicks and pussies, unlike Wan.
I greatly appreciate Wan as a model and I think it has a great future ahead, but you are deluding yourself if you think it will ever be as accessible as Flux, Chroma and co.
>>106129395And btw, the reason you don't see Chroma loras often is because the model is not fucking finished training yet, and I don't think Civitai has added a slot for it yet
>>106129433It's under "other" also some Flux loras work on it
>>106129287you don't get it, I do not care. I would be content even if I was stuck on sd1.5 for life, I would suck it up and learn krita and do i2i and inpainting. Noob is a miracle of modern technology, I don't really care if NovelAI has better prompt comprehension. I can do whatever I want with noob if I put in some effort. it's multiple entire boorus condensed into a 6GB AI model. I also have chroma. I have controlnets. I can make all the memes I want, I can make comics, art, anything.
>>106129324exactly
>>106129304just look at the amount of blatant mistakes and fuckups there are in the current models we use. how Chroma was able to prune 2b params that were completely useless. or that unfucked VAE technique that speeds up training by 7x (I forget the name). there are already a lot of speed and quality optimizations that are published but not used because the tech is moving too fast.
>>106129447>some Flux loras work on itThe weights are completely different, I know some people made hacky things to make it "work" but it's not nearly the same as a native lora, plus again, the model hasn't finished training, a Lora trained on epoch 38 will not work as good in epoch 48, that's the reason you don't see a huge Lora community around it yet
So the age of pics is over? It's only gonna be videos from now on?
>>106129494Do you watch streaming or stare at paintings in a museum?
>>106129395Chromaschizo...
For seven days, people have been posting WAN videos and images. Nobody is posting Chroma gens except you, or maybe to compare them with WAN and we already know the verdict.
Generating an image with WAN only takes one frame. It's five times faster than Chroma and uses fewer resources.
Also, why are you so offended?
Didn't you compare the amount of Loras for Chroma and WAN in Civit?
Making Loras for WAN is easier because it handles character consistency better over time.
You can basically make Loras from WAN by prompting the character to turn the camera around 360 degrees. We're talking about a model that can think in terms of time and spatial composition.
It's over for Chroma.
But if you want to insist, go for it.
Have fun, It's your creation.
Maybe I would feel the same way if I put effort into something that was completely obliterated by a Chinese product.
blonde european woman playing volleyball turns around and waves hello
reverse creepshot!
>>106129464>it's multiple entire boorus condensed into a 6GB AI model.I agree but I'm also wondering how many more anon could fit in there desu
We need a mesiah, someone that make a script to trasnform .sdxl Loras and checkpoints to .wan22
>>106129494You can animate Chroma gens and then animate them! They are cool!
>>106129547why? there is no motion vectors in sdxl
>>106129572But copyrighted waifus and copyrighted styles matters!
>>106129494I'll take a nice-looking pic over those sloppy stiff videos any day of the week. Hopefully all the pic genners don't get too discouraged to be driven off here completely.
do we have a source that documents the modding of a 4090 to 48gb?
>>106129395Finetune WAN then.
>>106129503>For seven days, people have been posting WAN videos and imagesOf course retard, we just had a new release (Wan 2.2), people are trying the new thing
And you just proved my point, 90% of the gens posted in these threads are just horny I2V videos, not artfaggotry or 1girls, for that people are still using Illustrious, Krea or Chroma itself
>>106129503>You can basically make Loras from WAN by prompting the character to turn the camera around 360 degreesNigger, who cares about this, lmao
Yes, Wan has a huge spatial awareness, but people are here to make pretty pictures, I am not talking about videos and you are raising videos as a point for no reason
>It's over for Chroma. In fact it's only getting started
See you at epoch v50 when we see a flood of Loras here and on Civitai, lmao
wan 2.2 is better at motion and movement in general.
>>106129604Never, and if you ever see one, it will likely be one of the SaaSfags making one and paywalling it.
Wan is a much larger model and more expensive to train.
If by some miracle someone fine-tunes and releases it for free, it would be a random chink lab and the model would be censored.
>>106129631Anon, im cringing for you, please stop.
>>106129527is that a camera flash or did a fuse blow
file
md5: a03153478bd85ad53030cc1485f3a0d0
🔍
https://xcancel.com/LodestoneE621/status/1951854210544541859
>Seems like with a little tweak the flow trajectories can be rectified to nearly straight lines without using reflowing procedure or OT coupling. This unlocks the possibilities of few steps model or even one step.
>>106129610>>106129631I understand, you put effort and money into it, have fun, it's your ride at the very end :)
>>106129667Just out of curiosity. Show me your "amazing" Wan IMAGE (not video) gens. Also, show me the style loras you trained for it.
Please not I am not talking about videos, I am talking about image gen.
>>106129661so does he have to start over now?
>>106129661>more lodestone schizo "research"This retard still doesn't understand the optimal transport pairing bullshit he's doing for Chroma training 1) doesn't do anything because of high dimensional spaces and 2) in low dim space where it does do something, it causes training-serving skew in the conditional case. And that's just one of the many things he's doing wrong in the training setup.
He is the textbook definition of a pseudointellectual. I don't know how more people don't see it.
Apparently someone got radial working ( I think ) and actually posted the resolutions. Looks like if the error pops up, you need to restart comfyui
https://github.com/woct0rdho/ComfyUI-RadialAttn/issues/5#issuecomment-3148588112
>I've noticed that once you get the not-divisible-by-128 error, you must restart Comfy for things to behave reliably again.
>1280x720 - 93 frames, 61 frames
>1280x704 - 85 frames
>1152x656 - 61 frames
>1088x608 - 125 frames
>1024x576 - 81 frames
>960x544 - 125 frames
>>106129565Wait till they hear about gta6 pricing
>>106129766Radial has always been working , it's just ass to use.
>>106129766ok but does it speed things up without bloating like the other anon that got it working and ended up dropping it?
>>106129565Is this just vanilla Wan 2.2 or do you need special loras to get good looking pixel art/cartoon animation?
>>106129761Anon, since you are a genius who likely works as a top researcher for a leading AI company making more than 1M USD / year, can you please train a model better than Chroma and release for us for free? That's nothing for you, right?
>>106129798just 2.2 with kijais workflow which uses the i2v lightx2 loras, nothing else.
>>106129712NTA, watch the threads, nobody is using chroma, fuck off!
>>106129761The main thing is most know nothing about training models so it's mostly a matter of who sounds more correct
>>106129840>lodestone is doing X, but it's incorrect, you can mathematically prove that it doesn't do what he thinks it does>"umm acktually sweety, where's your model? that's what I thought bitch. therefore he's right and you're wrong Q.E.D"you lodestone dicksucks are actually insufferable holy shit
file
md5: 932f40394f8470e6fcd319dbe7031ffb
🔍
>>106129712nta, chroma is okay but fails at multi subject images or any anatomy that isn't close up
i tried to generate pictures of bands playing and when i zoom in on the faces it's all horror
>>106127785>Google honeypot 2
Please help.
What the fuck is wrong with technology man, I've been genning shit all morning. Leave to have breakfast and now it won't work.
>>106129897>comfyfound your problem
>>106129876I never said he was right anon. I am just saying that, since you are so much more capable than him, can't you give us something better? I am sure you can, stop being such a lazy cheapskate!
>>106129794Only worked once for me, kek
>>106129796I havent had the chance to fully test it out without it shitting it self. The one time it worked, it was normal speed :/
>>106129958iirc radial wasn't about speed but a step towards more coherent longer gens.
>>106129885>or any anatomy that isn't close upThat is every model though
>>106129911>I never said he was right anonThen we agree, for the most part, I think.
I'm simply trying to point out that so many people worship lodestone like he's a god and it makes no sense. "Wow, look at this genius. Look at these graphs and charts he made. He figured out a new way to do rectified flow matching that will let us make few- or one-step models! Incredible!" But the reality is that this guy provably has no fucking idea what he's doing, he's just rich and able to spend the time and money to finetune a huge model like flux.
man in a white suit points an ak-47 to the right and fires it several times
never killed anyone btw.
Wan 2.2 mogs Flux Krea so hard is not even funny.
Get fucked.
Fuck those german puritan niggers.
man, wrangling noob for schizo shit is frustrating. dunno how that one anon does it. my genjam submission will be in soon
>>106130034I had a couple of ideas but I can't for the life of me get lain to stand on an exposed brain
>>106130029I used to go to boorus to fap, but I haven't done that in >2yrs. Instead now I go there to research tags and artists.
protip to any who haven't found this already:
https://danbooru.donmai.us/related_tag
>>106129842You realize the anon you are replying to (me) is one of the people posting Wan videos and talking about Wan in past threads, right?
And you realize that didn't make me emotionally attached to it or made me stop using Chroma (or Krea, for that matter), right?
And that people have not been using Wan as an image gen replacement to Illustrious / Chroma / Krea, right?
>>106129985>he's just rich and able to spend the time and money to finetune a huge model like flux.Please make your life goal to get rich, train models for us and mog him, anon
man in a white suit holds up dual silver pistols to the right and fires them several times
2.2 can do anything desu
>>106130055I just use this nowadays since you get a better idea of what tags actually works. have a big brain lain
exponential scheduler for anime wan genning makes it looks like weird watercolor with the shit it smears on the pic
>>106130082forgot link: https://tagexplorer.github.io
>>106130029>wrangling noob for schizo shit is frustratingvery much worth it tho
Is it acceptable to ask for generations when you're a generationlet?
>>106130073man in a white suit holds up a Miku Hatsune plush doll with his hands.
>T5-XXL vs T5-small with adapter
No text but the frog is nice
Training another adapter with more layers
>torch._inductor.exc.InductorError: AttributeError: type object 'CompiledKernel' has no attribute 'launch_enter_hook'
great, the wan 2.2 autoinstall bat broke the wan 2.1 workflow...
>>106130189try disabling torch compile node if you are using it or https://github.com/woct0rdho/triton-windows/issues/137
>>106130181Would be interested to see if someone can generate a picture from a top view, showing 10 women (a row of 5 left, a row of 5 right). They hold hands, ideally with the woman facing them in pair. And when that bed of hands is made, a new woman enters the scene and delicately lies on the hands.
Dang that's an accurate shadow
>>106130213picture/video, whatever is possible.
wan is repeating the history of stability ai
wan 2.1 - sd 1.5
wan 2.2 14b - sdxl
wan 2.2 5b - sd3 tier fail
there will never be a good video from wan again
>>106130253You skipped SD1.4 that came before SD1.5
>>106130207reinstalling triton worked, probably not a good idea to have the pre-release version in the autoinstall. Thanks
IS seedvr2 better than topazAI upscale?
Persuade me to NOT get a 6000 pro blackwell
>>106130097>>106130082thanks for the link. let me add, the related tags search on danbooru is very useful if you're trying to build a prompt for a specific style or pose. for example, say you want an artist who is good for the oil painting medium, you search for artists who are related to oil painting.
I'm catching up on things, is Wan2.2 worth setting up if I want to generate big bouncy boobies with I2V? Do I have to jump through many hoops to upgrade?
>>106129079Actually the opposite, you could have direct video evidence of a crime caught on tape and they could just claim its AI. Maybe not forensically but in the court of public opinion anything can be AI.
>>106129610>See you at epoch v50 when we see a flood of Loras here and on Civitai, lmaoI wish they manage train away the Flux face
a white cat puts on a top hat and starts dancing.
is this real chat?
>>106129610HAHAHAHAHAHHAHAW
You are so fucking delusional is amazing at this point.
>>106129459how did you stop the girl from constantly talking in anime style?
it's a major problem in my gens
>>106130253Except they have a shit ton more funding and aren't run by safetyslopping troons. Stuff like that matters.
>>106129761>He is the textbook definition of a pseudointellectual. I don't know how more people don't see it.Sure but:
- he actually released something that mostly works.
- no one else gives a shit/has the money/has the knowledge to do something better.
>>106130376this is why the jews are just openly committing genocide and bragging about it now. when they're done they'll generate a bunch of fake AI videos of jews committing war crimes and then say look, all those videos of us committing war crimes were faked by antisemites.
>>106129897reboot?
reinstall comfy?
>>106130359np anon.
>>106127785gen submitted otherwise I'd just be indecisive for hours. I hope there is a lot of submissions at this point!
Raw 1080p without ooming is crazy wan.
>>106130359i find it good for characters too i.e. what things they wear etc, if that info isn't on their related wiki page
>>106130488I'd hate to tell you this but your gens suck. wan needs a finetune to not look like generic slop and evil neurosama is lame because it's an llm running on someone else's hardware. get your own ai waifu already, it's completely possible now
For Wan2.2, is there no way to reuse the same lora nodes for both High/Low models? It seems very inefficient to duplicate them both.
>>106129891You don't need an email
https://files.catbox.moe/hinhyv.mp4
>>106130094>>106130222>>106130488Are these the power of the "Chroma killer" Wan text to image generation? Because even the fucking SDXL-based Illustrious look better than that
>>106130517>wan needs a finetune to not look like generic slopNo anon, you are wrong! The anons in this thread already stated the FACT that Wan image gen is the be-all and the end-all solution, no need to perform a million-dollar fine-tune to train the slop off! It's perfect as is
where do you post girls with dick nowaday ani?
>>106130597Do you not get tired of seething about everything all the time?
>>106130597the root of Chinese models being weird at t2i is a) too much synthetic slop in the datasets and b) very shitty captioning by esl retards who don't know video or image terms
>>106130222>that's an accurate shadowtrips and confirmed. nice.
>>106130465Heh I did both before reading your reply, no dice.
All-in-one keeps working, but I want to keep using Kijai's.
>>106130597>People using tool with no specifications is the best tool can doYou are so fucking retarded no wonder you are trying to defend chroma, probably because you are that furry retarded nigger training that garbage.
https://old.reddit.com/r/StableDiffusion/comments/1mec2dw/texttoimage_comparison_flux1_krea_dev_vs/
Want shits so hard on your trash down syndrome trannytune that will do generational damage to your non existent bloodline.
Lets talk about Comfy and how bad it is please.
>>106130614always /d/ when I have time but the institute of foundational models wants to chat with me. I could be making robot brains soon but we'll see how it goes
how long does it take to gen a decent wan video? is it worth spending a little money on runpod to play around with it?
>anyline lineart lineart_anime from controlnet_aux does not work with a 50XX
GODAMNIT nothing else is as good
>>106130651I'd still use the speedups like the distill and caching so you get the most out of your time. it's pretty easy
it's okay to lewd republicans right? it's probably not written down as law but that's the impression I get
>>106130660how long for a decent gen?
>>106130630No, I would summarize the problem as:
1 - They don't release the pretrained models, only the dreaded "aesthetic fine-tunes". Unlike the LLM scene, where the labs release both the pretrained weights and the instruct fine-tunes.
2 - they do use synthetic data in those fine-tunes, probably a lot
3 - They use synthetic captions for everything instead of paying people to manually label a subset of the data, in a way regular humans would prompt
>>106130672480*832 video about 1-2 minutes on my 4070s
>>106130686hmm might be worth a try, if you wouldnt mind, can you cite a decent t2i example at those parameters?
kekd
md5: 69dadad732273712a54821d0d2f5e6c1
🔍
>>106130638>ayo dis nigga got fake tensors
>>106130692Fuck you anon I keked. Now HELP ME
>>106130682yes, #1 is something I missed. very true. BFL is even worse since it's a student model as the open source release as well with a very shit licence
>>106130696kek, wish i could help, i never use kijai wan nodes however, it kinda reminds me of this error https://www.reddit.com/r/comfyui/comments/1jnaujh/sudden_triton_error_from_one_day_to_the_next/#lightbox
where I had to delete a torchinductor folder but might be different for you
C:\Users\USERNAME\AppData\Local\Temp\torchinductor_USERNAME
>>106127742 (OP)>top rightnyanners?