Age of Torrents Edition
Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>105870663https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows/home
>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanX (video)Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1
>ChromaTraining: https://rentry.org/mvu52t46
>Illustrious1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate
>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/celeb+ai>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
>>105875122Made the new as I was typing this.
I won't be posting here again. Just please stop polluting this board with this crap.
and nothing of value was lost
>bitches leave
Gen it pls, best movie
>>105875136Techlet genneral
Sloppers general
SDXL stagnant general
I will keep generatig images fo donald trump vs pepe the frog with kontext and doing img2video of slopy trashy celebs like taylor swift.
Stealing your Sabrina idea. It's mine now.
Why do I get a black empty image every time I use a scheduler other than beta in ksampler using flux.
Same thing with the sampler, most of them just don't work and produce a blank image.
Is it a resolution thing that throws it off if it's not a specific size of latent or what?
Euler/beta and Euler/simple seem to work (simple not always) and the rest just don't produce proper images at all.
>>105875103 (OP)Could someone re generate that top right video and replace the Indian dude with a white twink and have the girl swallow him?
>>105875122comfy posts here more often than reddit so who cares.
>>105875103 (OP)Is there something like llama.cpp that lets me run this on my CPU?
Blessed thread of frenship
>>105875205Anon posted the lora a few threads back
>>105875318What's the fun in that?
i just got back into the local scene for wan 2.1 and is that shit basically magic or what?
anyone got any opinions on the 720p model? seems like it got a little bit less support, does everyone just gen at 480 and upscale or is that GPUlet behaviour?
>>105875328480p looks fine and it's much faster, I'm not trying to generate 4K clips.
stagnant thread of saas sloppy seconds
>>105875136get thicker skin
>>105875427its not that serious
anyways, back up your files anon...
...anon??
>>105875328I prefer a specific finetune + loras
But overall itโs been pretty fun
Which is better/less dambooru nerffed?
Using the SDXL base that understands natural language and tags + 10 Loras, or using Civit AI's autistic checkpoints that know nothing beyond the scope of the tags?
>>105875103 (OP)wow never made the thred image before
>>105875327hope this guys still here, I was busy with work
https://files.catbox.moe/w8ef79.png
I have an RX 9070 XT 16GB.
Is there any possible way to run an img2vid model somehow on this machine?
>>1058754771 model, sloppy, other model, better, image, and shapes, accuracy, sacrificing, creativity,
>>105875458>>105875350yeah that's kinda what i figured thanks. prolly just try it on runpod for a bit to get a feel myself
>>105874068Requesting catbox of this gem. I will cry if you say no
>>105875509Please, anon. I beg you. Just point in the right direction ;_;
>>105875546Sorry I'm stupid, thank you though
>>105875483The baker chooses the yuckiest ugliest ones on purpose
hey hey ho ho
baker is a schizophrenic toad
hey hey ho ho
>>105875553that's why we're all here
>>105875554picrel its you when you're mean to me. pull one or both triggers
hey hey ho ho
schizoid baker has got to go
hey hey ho ho
>>105875578finally I've been waiting like 4 hours for that part
Do any of you guys use runpod, or are you running the models locally?
>>105875492linux only and not with a ton of tinkering. when/if rocm comes to the 9000 series it will be much easier.
>>105875568>tentacles>>>/h/
>>105875604locally obviously. i'd never run my stuff on someone else's computer.
>>105875604i've been meaning to swap to runpod for a while since electricity costs are so high over here anyway. also i kinda like being able to use my PC while I gen.
started looking into it and seems pretty easy? like at worst you'll have to ask an LLM
>>105875661they're business tentacles. This isn't some perversion, it's a routine exorcism and they are important for business operations.
>>105875670i mean unless it's illegal stuff aren't you fine? runpod let's you do NSFW stuff and is at least claiming privacy, so what can really happen
>>105875689>he wants to make chomo shid like the baker posts in other threads/generals
>>105875682like my electricity? 45c/kwh
>>105875696i don't even know how to feel about that yet desu. better AI than the real stuff at least. anyway, yeah. doubt you'd use cloud for that
>>105875604>>105875672the biggest pain point is setting up your environment and downloading models/loras/custom nodes each time you start up an instance. You can script it all out but it still makes a ramp up period when you start an instance.
>>105875722i thought network volumes fix that, idk haven't tried yet
>>105875682It worked out, she's on vacation now
>>105875751Bonus points for Pocari Sweat
>>105875736costs money to store stuff, and on runpod you can only use network storage with the 'secure cloud' servers which are usually 2x cost.
>>105875777shit's so good. I heard it tasted like how water tastes in a dream so I bought some on Amazon and now it's twice as expensive.
also checked
hey hey ho ho
schizoid has got to go
hey hey ho ho
>>105875804ah well, might still be worth it, i'll think about it. also there's other providers that are even cheaper. just looked at vast.ai they got 5090's for 60c/hr. i'm in europe so accounting for the weak dollar that comes out to paying 5c/hr for me, really gotta look into that.
>>105875870it is worth trying, way cheaper than api calls. I did it for a while when my second system was down because I enjoy genning while I do other stuff on my main pc. Getting a good setup script was the biggest thing for me, once I had a 'open instance, run script, just wait' pattern it felt way accessible. Just remember to filter for 1gbps+ network.
>>105875898thanks I'll keep it in mind
0
md5: 6a6fcdfa29c674d717787d871931d5b0
๐
>>105875729>>105875808>>105875839why is the indian immortal? is this a message?
>>105875136Nothing is happening in local image gen right now except everything getting banned. What is there of substance to talk about?
>>105876108look at the amount of people being born worldwide, indians and their diaspora will be pretty much immortals
>>105876116>Nothing is happening in local image gen right nowIncorrect. Anon is still making kino gens.
I'm using comfy, illustrious, and wan predominantly.
you guys have opinions on better options on the horizon?
>>105876141Chroma finishes training in about a month if I'm remembering correctly, Wan 2.2 is releasing sometime soon supposedly. That's about it.
pop
md5: 2268ffbbd7df127e8b5311cadf4b21dc
๐
>>105876132The future is much worse than you think my friend.
https://www.populationpyramid.net/africa/2024/
https://www.populationpyramid.net/india/2024/
>>105876116Radial Attention
>>105875547Hey, what similar video was this trained on? This is for research
>>105875328>seems like it got a little bit less supportnot sure what you mean. i've been posting vids genned at 720 natively. it just comes down to what your system can handle and how long you're willing to wait for the gen. also, increasing the resolution might force you to decrease the length due to ram requirements, so you can decide if you want a long gen at low quality or a short, high quality one.
>>105876180bruh can you keep the demoralization on /pol/
>>105876108I couldn't destroy it, I finally gave up, I'll try first and last frame later
>>105876320>muh pol boogiesanon pls
keep your fingers in your ears anywhere\everywhere else kek
>>105875751i DO like it ;3
you're cute mister!!
>>105875170>the cutting-edge news has no valuewhat did he mean by this????
>>105875313>a friend loves at all times
>>105876291just meant some specific LoRas don't work with it? or not as well, as I understand? I've been doing some stuff locally but only 480p and now i just want to throw compute at it.
720p doesnt take that much more VRAM does it? so i guess i'll just rent a 5090 and not an H100
>>105876394I dunno, 720p loras seem to work for 480p and vice versa. Even t2v loras can work for i2v i guess. It's weird. If you go on civit and look at the wan loras, some of them have both 720 and 480, but then you download them and it's the same exact file. Who the fuck knows.
>720p doesnt take that much more VRAM does it?WRONG
>>105876348that's it. you're grounded, young lady. stay in your room, and stop looking at me with that stupid expression
>>105876432>I dunno, 720p loras seem to work for 480p and vice versa. Even t2v loras can work for i2v i guess. It's weird. If you go on civit and look at the wan loras, some of them have both 720 and 480, but then you download them and it's the same exact file. Who the fuck knows.i heard it's something about them being "less strong" or something? idk.
and fuck, how much VRAM do you peak at? i guess i'll need the H100 after all
>>105876451forgot to mention, a 5090 is fine unless you want more speed
>>105876116>https://huggingface.co/NewBie-AI supposedly expanded neta lumina to 3.5B parameters and switched out to using gemma 3 4b and jina clip>https://github.com/NeuroSenko/ComfyUI_LLM_SDXL_Adapter
>>105876487is a 3.5b model still gonna be retarded even when specifically trained for it? I'm hopeful but skeptical
>Back From Deployment
>40 days later
>Nothing ever happened
>Still no /ai/ board.
Im gonna keep it *wholesome* for the jannies because I reached my goal of having a million dollars liquid yesterday.
>>105876608>>40 days later
>>Nothing ever happenedfeels like a lot happened in the last 40 days
one word prompting is insane
>>105876320despite what you think it's good to know the world you live in.
i dont know how we will manage to produce enough food for all these africans so maybe the problem will solve itself.
>>105876647I know full well. I don't want to hear about it every fuckin minute.
>>105876608bro, shit load has happened in the last 40 days. big nag. big kontext. big bellies. we have it all now. can't be stopped.
>>105876782I understand but you must understand I didn't post it just for you.
>>105876647your post made me think of the Documentary Dive. i miss early Netflix and the cool niche underrated shit they had around every corner.
Where do you connect the Load Checkpoint node in the rentry guide's wan i2v workflow? I was told yesterday it should replace the UnetLoaderGGUFDisTorchMultiGPU at the bottom left of pic related. Is this true? And if so, does that mean the UnetLoaderGGUFDisTorchMultiGPU parameters are not important?
Also what about the clip and vae pins from Load Checkpoint? Where should they connect in this workflow?
Where do you connect the Load Checkpoint node in the rentry guide's wan i2v workflow? I was told yesterday it should replace the UnetLoaderGGUFDisTorchMultiGPU at the bottom left of pic related. Is this true? And if so, does that mean the UnetLoaderGGUFDisTorchMultiGPU parameters are not important?
Also what about the clip and vae pins from Load Checkpoint? Where should they connect in this workflow?
>>105876890wait, why do you need to replace it?
>>105876898I want to load a custom checkpoint downloaded from civitai.
The rentry guide doesn't cover using custom checkpoints in the wan workflow and I don't know how to integrate it.
>>105876905i have no idea what you're talking about but i think you need this node
>>105876898>wait, why do you need to replace it?Also I am assuming that's what this anon was insinuating:
>>105871381>>105876912I want to know where to put this: https://civitai.com/models/1626197?modelVersionId=1852433
I'd like to know exactly what node to use and where it should be connected in the wan comfy workflow provided in the rentry guide.
would be a shame if there were bashbunni pics heh.. a real shame..
>>105876931replace unetloader with load diffusion model and set weight_dtype to fp8_e4m3fn. that's it
>yet another "general" where baker deletes all posts he doesnt like
you ever accidentally generated something straight out of hell?
like you typed a normal seeming prompt but then it gives you something truly demonic, anyone knows what I talk about?
>>105877156low cfg will do that to you
>>105877156>ai>demonicchecks out honestly
>>105877156>gives you something truly demonic,AI is a mirror.
I was talking more in general of the world. But what has happened in the world of AI in the last 40 days? Is shit getting more restricted and payment processors still keveching?
Is ANI still around? He should start shop in Japan. They actually are doing so much to support it here.
so this is the power of aniwan...
>>105877263im downloading it right now... maybe I should save the bandwidth
i rented a 5090 and ran out of memory on WAN480p. I'm assuming that's a skill issue, shouldn't really need quants with 32GB VRAM right
>>105877261Simpletuner dev went nuts and started mass reporting literally everything. Got a bunch of Flux and Wan LoRAs banned, tried to report Chroma, and today got NSFW banned on Tensor.art by complaining to their payment processor.
>>105877291yes, fp16 is like 60gb
>>105877263Are there any good gens at all?
>>105877162>>105877280cute
>>105877297funny thing, tensor art is filled with child erotica stuff
celebrities are the least of their problems
>>105875682It's night time. business tentacles is over, Kirby tentacles time
>>105875103 (OP)how do you combine videos and pictures? what is the format call? o.o
>>105877301ah woops that makes sense. oh well, runpod seems pretty easy, got it setup and working at least.
gotta figure the details, like network storage is slower right? might just throw everything into the docker and not have to pay. i dont mind the 15 min startup.
>>105877420theres a collage script in the post you replied to
>>105877430by slower i meant generation times
lol
trying to make two people kiss (on the cheek) using kontext
it makes a collage with the two images and inserts a random man in the middle
fucking hell image stitching is broken
>>105877156yeah when I first installed comfy ui and tried to do porn i got 4 images of what seemed like someone peering at me through the screen with each image getting closer and the eye bigger. I kinda wish i saved the images, they were truly cursed. and didn't match my prompt at all. But after that the AI gave me everything i ever wanted.
WHAT COULD POSSIBLY GO WRONG
I think my new meds (...) are killing my libido, which in turn is killing my desire to proompt
I found freedom but I don't want it
>>105877478I take 470 mg of meds daily and don't get random boners unless I'm masturbating
maybe this is why I don't like doing 1 girl anymore because I can't be arsed lol
>>105877576der untermensh
>>105877576M-MUH WAiFU!!!
>>105877369I don't understand how they haven't gotten shut down entirely to be honest.
an anime girl rides her bike down the street. she waves as she passes a boy who is walking in the other direction
an anime girl rides her bike down the street. she waves as she passes a boy who is walking in the other direction
>>105877749Being an anime/animation inbetweener you must be sweating bullets at this point, your job 110% guaranteed to disappear within the next two years.
>>105877761I'm convinced it's impossible to get a good gen with this shit checkpoint.
>>105877772Thoughts on Chroma?
>>105877263>>105877417>>105877533>>105877761is this an official Wan checkpoint or someone made an anime version of it?
>>105877901are you testing it yourself? it's wan so failgens are to be expected
>>105877942https://civitai.com/models/1626197?modelVersionId=1840561
>>105877925it's the best furry porn generator ever
It seems that adding motion blur effect to an initial image for Wan changes video dynamics most of the times, increasing speed or amplitude.
>>105877974>are you testing it yourself? it's wan so failgens are to be expectedNot yet but I will.
I've gotten several good gens from hailuoai. How bad can wan be by comparison?
>>105878059>hailuoaiBeahahagahahah
https://civitai.com/models/1732367/ghibli-style-flux-kontext
although you can do a billion styles this is pretty good for the ghibli style
the man in the blue shirt is holding a white sign saying "BIG GUY". make the image ghibli style.
>>105878241this looks about the same as getting I'll to do img2img over the screenshot. I don't appreciate boring signs anymore. They were funny for the first day only. same with the guns. make OC for once you fucking npc
>>105878241one more example, picked a random anri photo
make the image ghibli style.
>>105878261it's just an example. you can literally generate a girl getting fucked by Cthulhu if you really want.
>>105877925I like it. Just finished a quick Chroma lora test, cailee spaeny, 20 images but for some reason the bucketing decided to drop 1 so it ended up training 19, rank 16, 512x512 training resolution, 100 epochs.
Need to experiment with higher rank and lr (this was 1e-4), also more images, at least 30. That said I'm impressed with what Chroma delivers with only training at 512 resolution, also really do the grainy photo look and the overall realistic skin.
The man in the blue shirt is standing in front of a department store that sells business suits. The sign above the store says "BIG GUYS 4 U". The man is wearing the same clothes as the image. The man has his arms out and is smiling. make the image ghibli style.
see what makes kontext neat is the transformational aspect, it's good at swaps and character manipulation.
>>105878296and without the lora to do a realistic gen like the original image which is more suitable imo:
What is flux and how is it different from stablediffusion?
>>105878313it's a more "advanced" model that understands natural language, not just tags, but it also has steeper hardware requirements and is heavily censored.
>>105878304remove the plane behind the man in the blue shirt and replace it with a large yellow rubber duck.
genuinely impressive how it can see "layers" during generation, with img2img and inpainting you cant really do this with high denoise even. very cool tool. inpainting is very useful but this is another very neat tool for edits/reposing stuff/changing stuff.
The pink hair anime girl is on the cover of an anime magazine with the title "UMA", on a bookshelf in Akihabara, Tokyo. The headline "best uma ever, Haru Urara!" is at the bottom of the magazine in large, playful text. The anime girl is smiling.
The font even emulated the text style from the screenshot. And the character was on a screen with lots of text and it was depicted just fine.
kek, this time I got a magazine but with the screenshot
>>105875243Not everything works together, but Flux isn't limited in what Sampler/Scheduler it uses. For example, I've been using Res Multistep with Bong Tangent for quite awhile now.
>>105875243>>105878513Anon posted a mega grid of samplers/schedulers with Flux awhile ago I can't find it
>>105878278Hear me out. So I think training 1024 is a mistake if you want to avoid censorship. Reason for that is modelmakers censored their model at 1024. 1024 training is the reason for fake skin. This is why Chroma is so good, since it hasn't been trained 512x512.
>>105878594>hasn't be trained 512x5121024x1024*
How do you create video loras? Do you use images to train them?
>>105878628funnily enough, you use videos
>>105878397Notice it enhanced the anime style here too. This is way better than what base Flux gives you, and more in line with what Chroma can do with pure txt2img. A shame we're only getting a censored and watered down version of this beast of a model.
>>105878594>This is why Chroma is so good, since it hasn't been trained 512x512.You mean because it has been trained at 512x512, right ?
And yes it has, while the model I used for inference was the detail calibrated which is a merge with the 1024 res fork IIRC, the model I trained against was chroma-unlocked-v43.safetensors which I believe is 100% 512 trained (the last two epochs are planned to be bumped to 1024).
The 1024 fork has only been going for a couple of epochs, the 512 main branch has been going for 43 epochs.
That said, I'm unconvinced 1024 training is somehow responsible for fake/plastic skin, I think it all depends on the training images. Flux dev was trained on synthetic output from Flux Pro, that means it will look less realistic. Still, I am impressed at how much detail Chroma retains from training at mere 512 resolution, this speeds testing up SO MUCH.
>>105878632Any good text or video guide? This rentry was last edited in August 2023 https://rentry.org/59xed3 and doesn't seem to talk about video loras.
>>105878635neat thing about flux/kontext, and wan too, is it can take a blurry image and make it HD. I tried wan with a crappy Haruhi image and it made it nice and sharp while in motion.
>>105878652new to DiT \ flow models?
>>105878655not really, it's functionally no different to training on images
there's some meta stuff to be aware of to minimise wasting time on shit that can't or won't work, but you'll figure that out
>>105876487>ComfyUI_LLM_SDXL_Adapterinteresting. adapter models still aren't out tho
Why doesn't Wan2GP allow you to import custom models/checkpoints? Why do they insist on locking you into what's available through the interface?
>>105878652>I'm unconvinced 1024 training is somehow responsible for fake/plastic skin, I think it all depends on the training images.I feel like certain models have quirks, and this censorship is noticeable at certain resolutions. Idk, I hope I am wrong and Chroma does well at 1024 training near its end.
Makoto and I are admiring guns she can find in Tarkov. (Most look like shit...)
file
md5: 5b6b763162f110f73df81e47f7e7daf5
๐
fucking hate moralfags
we need to stop depending on centralized niggers and start seeding
>>105870750here bro, duno if it helps but just adding it without any prompt is ok
u can use other nsfw loras too
https://litter.catbox.moe/i6d5rgvaqhzkkkm7.safetensors - v0.1.0
https://litter.catbox.moe/3ij4louxccghybfy.safetensors - v0.1.5
i dont understand the prompt either
someone should upload it somewhere, my hf repo got taken down and im too lazy to upload again
>>105878816maybe 20 years ago, doesn't look out of the ordinary today
First time I ever got a clean 4x4 grid
Would it be possible to use AI to generate scenes in a certain old 3D comic style?
I have this old as fuck 3D comic that just ended without a real ending. It just got dropped. I'd like to pick it up where the OG artist left it. I already upscaled the work but what could I use to generate more?
>>105878958Got that 90's VGA pc game look
Is video gen bad at handling declothing characters?
It always made me sad how few anime show female characters removing their clothes, probably because it's time-consuming to animate.
interesting test: replace the dress of the anime girl with the flower pattern dress. keep the anime girl's pose and expression the same.
this is with flux_kontext_clothes_remover lora: in ksampler preview you can see it changing the dress section, to the pattern.
>>105878958>>105878999I am once again asking for your support.
How are you making these?
>>105879307now without the lora, it fails. why? the model thinks you are trying to make lewds, so in this case it fails. but the clothes remover works for the dress. even though it's clothed, it's going nude -> clothes + texture.
>>105879323that is interesting
Can the 2character workflow be used with two loras for each character?
>>105879323and re-enabled, it once again works.
so if you prompt clothes to change, it will work without a lora. ie "anime girl wearing a tshirt". BUT if you say to replace clothes with (dress/shirt/etc) from a reference, I guess the model thinks "trying to lewd, dont do it".
>>105879351this time, single image no second image reference (bypass, so no stitch), "the anime girl is wearing a white t-shirt and blue jeans. keep her black blindfold unchanged."
works fine. so just something to be mindful of if you want a character in a specific outfit from a reference: use the clothes remover lora. you'd think it was for lewds only, but not just that!
the cartoon man is sitting on a couch at home reading a magazine. The magazine has the title "CHUD weekly". On the magazine cover is a fat black woman with the headline "DEI dead!" below her. A window nearby shows a sunny beach. keep his expression the same. keep his hairstyle the same.
all from a chud face image, no body.
> he can't even stick to a single name
did daddy love you a little bit too much or why are you like this
the cartoon man is sitting on a computer and is typing. His t-shirt says "chud of the year 2025". keep his hairstyle the same. keep his expression the same.
great meme generator desu
How are kontext LoRAs trained? Is it exactly the same as regular flux training?
when will they stop using the studio ghibli filter, place your bets to win a 5090 super
>>105879346if the lora itself isn't shit, it should work
>>105877925It's definitelly gonna be one of the better models out there. Big chunk of the criticism is by promptlets butthurt they can't just paste their booru tags.
the man in the image is diving sideways in the air, firing dual silver pistols at the camera. he is wearing a black suit. the background is a highway in America, on a sunny day.
thats what people get for holding up traffic.
>>105879613The image is a hollywood style movie poster for the film "DIE HARD 3". Include the man in the image in the center. The man in the image is firing a silver pistol at the camera. he is wearing a black suit. the background is a highway in America, on a sunny day. At the bottom of the image is the text "dont get in his way." Include film credits at the bottom of the image to make it look like an authentic movie poster. At the top of the image is the text "this summer, he has had enough."
cinema
>anisora V2 is 65gb
https://huggingface.co/IndexTeam/Index-anisora/tree/main/14B
wtf, so this Wan anime finetune isn't usuable for anyone but 6000 blackwell chads?
>>105879540Never.
How can I receive my 5090?
>>105879674>Has the word Sora in its name>Isn't an openAI productScam
shoutout to ben garrison:
>>105879701"western cowboy film in the style of spaghetti westerns" helped refine it:
>Wan21_T2V_14B_lightx2v_cfg_step_distill_lora_rank32
Stop shilling for this piece of damp trash now, 'cause it is pointless
>>105879674wtf I thought that pedo was supposed to save anime after doxing their coworker
guys do you think i did good
>>105879725>nowwhy? it makes gens super fast, I can make a 720p video in a couple minutes.
>>105879693Provide us with a prefferable adress for personal pickup anywhere along the Yangtze river. We'll let you know once never passes and we've verified your anwser to be correct. Disclaimer: Your social credit score must be at minimum high enough for you to be eligible for a party position. Link your WeChat account so we can send you a notification once everything is ready.
>>105877068>replace unetloader with load diffusion model and set weight_dtype to fp8_e4m3fn. that's it>replace unetloader with load diffusion modelCan you elaborate?
How does the "Load Diffusion Model" allow me to load a custom checkpoint? I can't edit either of the parameters, let alone configure "aniWan2114BFp8E4m3fn_i2v480pNew.safetensors" as an input anywhere.
Isn't "Load Diffusion Model" for gguf files?
I can't even change "unet_name" for the "Load Diffusion Model" node.
Sorry if these are all noob questions. Just trying to figure out how to get wan2.1 working right.
>>105879776Forgot to ask: would "Image Only Checkpoint Loader (img2vid model)" be the correct node to use instead? It actually has a checkpoint parameter which defaulted to the safetensors file I have in the checkpoints directory.
>>105879745if fucks up colors/saturation/contrast, you name it
It might be good for anime style, but it sucks ass big time for realistic gens
what should i type if i want to gen a 1girl with this kind of tummy?
>abs
that gives her a muscular six-pack and that's not what I'm hoping for
>>105879963(abs:0.5)
Might also try toned, just experiment with weights between 0 and 1 would be my approach.
>>105879994basically this, weights are great cause you can adjust how much of that prompt you want to apply, from a bit of definition to super bodybuilder.
>>105879903It's nowhere near as bad as you're claiming. NTA btw.
>>105879963try protruding tummy/stomach maybe
Please don't post sexo on SFW boards, it's making my semen retention more difficult.
Anons will sexo anything and everything.
>>105879740yeah, which model?
>>105880061umm
that's a man
>>105880140From Civitai,
>PornMaster-Pro ่ฒๆ
ๅคงๅธ- Illustrious & noobPlus a lora that I made from Shirogane cosplay photos
>>105879776Use load diffusion model like I said. Sorry, you have to put the safetensors file into your models/diffusion_models folder. That part isn't obvious.
By the way guys, how do you tell what folder a particular node is scanning for? It's pretty obtuse sometimes.
>>105879779no
>>105880146You shut your fuckin mouth before I slit your throat
>>105879439>On the magazine cover is a fat black woman>Miranda Cosgrovekek
>>105880059At which do you use it?
Is vace any good? Does anyone here make vid2vid gens?
>>105880177please do anzujaamu lora some day
>>105879963mercy overwatch, bikini, cute, protruding tummy, toned, blonde hair, ponytail, masterpiece, absurdres
it's not perfect but it's a lot better than my previous attempts at making loras
>>105880191>Use load diffusion model like I said. Sorry, you have to put the safetensors file into your models/diffusion_models folder. That part isn't obvious.Ah, thanks
>>105880059Maybe, it is just not compatible with Fusion t2v, or the later sucks
Can someone explain wan prompts to me? Relative to the rentry guide's setup.
For anime-style videos, do you use danbooru tags like in the image generation guide? And if so, how do you integrate them into a video prompt?
>>105879745that's great. now post a lightx2v gen that is not a girl just shuffling in place.
>>105880279>For anime-style videos, do you use danbooru tags like in the image generation guide?no, just describe what you want to see in the video with simple sentences. for example, for this gen
>>105877533the prompt was: a pretty and sexy anime girl sits across the table from the camera at a restaurant. she is eating a bowl of udon noodle soup, slurping up noodles with chopsticks
>an anime girl wears a red cap with a white "M" on it and she wears a red shirt and blue overalls and brown shoes and white gloves. she runs along the street, the camera panning and following her. a turtle enters crawling on the ground. the girl jumps and lands on the turtle
>>105880343>no, just describe what you want to see in the video with simple sentences. for example, for this genWhat if I want a very specific style? Is using loras the only option for video?
Also isn't wan pretty bad for nsfw without a good lora?
>>105880268weak should fear the strong
>>105879674>https://huggingface.co/IndexTeam/Index-anisora/tree/main/14BWAIT!
-It is a 14b model trained in 2d animations especially anime.
-Understands natural language
Questions:
1)Can it be prompted cleanly without any reference image?
For example 'An image of one girl, hatsune miku in the bed....'
If yes:
Wouldn't this be like a "Flux" but for 2d?
2) In case it can only be prompted to generate img2vid:
Wouldn't this work like a Flux Kontext but for 2d/anime?
For example: I attach the image of Hatsune Miku in bed and prompt 'Now this character is in the park dressed as a clown'.
>>105880358>>105880487Im this anon, this is the model? Can you generate one frame? (one image?)
Wouldn't this model be like an image generator that understands natural language to some extent?
>>105877478i feel your pain, my dick hardly gets hard anymore unless i intensely concentrate and look at the lewd sauce. day dreaming boners have become less of thing for me and am getting horrible insomnia and depression. The only thing that giving me hope stay for the next day is literally ai sloppa. Gaming and anime doesn't excite me the way ai slopping does. I spend more time genning eve sloppa than playing her game :'(
I'm pretty sure I got posessed yesterday. I was gooningto sloppa and I saw something like a rift with a lake of purple hands and
>>105880510Did you try roleplay through LLM? As an /aicg/ anon I can tell you that on my days off I have boners up to 8 hours non stop.
>>105880487I see what you mean, but if you look closely at the video, you'll see that the image quality is not very good. It looks like the image was not drawn with a high resolution, as if the "hires" option was not used. I think that between loading the model and trying to make it act like Kontext. I think Kontext is more practical for you.
>>105880501Not, he is using aniwan, which is on civitai and was released a few months ago, the one on HF is anisora and just come out yesterday, and doesn't have any quant yet. I don't even know if someone will care enough to do one.
it's a i2v model, and what you want in 2) is just Vace, but Vace wasn't finetuned for anime so it's shit at what you are describing
can regional prompting and kontext grouped together?
I can't get image stitching to work at all
Even though I'm uploading two images in comfy's official workflow and prompting "the man in image 1 is kissing the woman in image 2 on her cheek" neither in the output look like the input images
What am I doing wrong?
>inb4 share your workflow
I can't, personal images in the workflow
Here's the official comfy workflow
https://raw.githubusercontent.com/Comfy-Org/example_workflows/main/flux/kontext/dev/flux_1_kontext_dev_grouped.png
It seems like Veo 2 is about on par with WAN. But Veo 3 is like black magic. I wonder how long it will take for local to catch up.
>>105880830Dreamina Seedance is supposed to be better than Veo3 (at just in the benchmarks) but it is completely useless for anime and completely outclassed by hailuo02.
>>105880830>I wonder how long it will take for local to catch up.I don't believe it ever will. That's just too many Bs of parameters for too smol VRAMs of consumer grade gpus. We certainly can play smoke an mirrors to make up for some of our limitations, that's for sure.
>>105875608it's already been out for some time my bro (may 21st)
>>105879725What is everyone using in its place now?
How do you manage these seams caused by detailing? I tried to add a seg model in addition to a bbox one, but it then tends to miss the face occasionally (and cuts out the eyes often).
>>105880864https://civitai.com/models/1626197?modelVersionId=1840561
How do you use multiple loras in the ComfyUI wan workflow? The lora stacker node doesn't seem to exist in the portable comfyUI installation.
>>105881051Is there a native lora stacker? The one I use is in Comfyroll_CustomNodes
>>105881051You chain them one after another. If you're using Wan then you need model only Lora loader.
file
md5: e71eee4b951b7acc34c60a87ac07a559
๐
>>105881051https://github.com/rgthree/rgthree-comfy
You can ignore CLIP and connect only the model
>>105880892img2img at a very low denoise
>>105877068>>105880191>replace unetloader with load diffusion model and set weight_dtype to fp8_e4m3fn. that's itAm I doing something wrong here?
>>105880343lightx2v can actually work if not using it at full strength and combining it with other snakeoil like accvid/causvid, but yeah there's no such thing as lossless optimization when it comes to hyper/lightning loras.
>>105880276FusionX is a bad merge, it has these loras baked in https://huggingface.co/alibaba-pai/Wan2.1-Fun-Reward-LoRAs/tree/main and they tend to alter faces in i2v and sometimes cause gray artifacts.
>>105881256Click on the umt5 to see if a drop down menu pops up. If not, you put the clip into a wrong folder
>>105881285You were right about the missing text input, but it still says it's missing a model.
>Failed to validate prompt for output 95:>* TorchCompileModelWanVideoV2 125:> - Required input is missing: model>Output will be ignored
>>105881310Anon you're muting the lora loader node, it stops the workflow there and then. If you don't need it, you have to bypass it like the patch model node next to it (Ctrl + B)
Try bypassing torch compile. also wtf is magic_wan? Maybe bypass the lora loader too.
>>105881335You're right, I'm dumb, sorry for wasting your time.
>>105881341>wtf is magic_wannta but I was just going to ask wtf MagicWan_converted.safetensors is myself because it's in the OP workflow. I googled and found nothing.
What's the best checkpoint for architecture?
JuggernautXL struggle to generate believable buildings
Can Chroma eat a sketch controlnet?
Can anon gen an artistic masterpiece?
>>105881280The motion is good but it slopped the shit out of the image. I think it'd be good for a goon sesh though where you can just shit out bouncing titties quickly. Is that just lightx2v or also causvid or whatever?
Anyone else update comfy lately? My WAN workflow is completely fucked now. It uses too much ram and completely ignores limit in UnetLoaderGGUFDisTorchMultiGPU
When my video gen finishes, I notice python is still consuming 10gb+ ram. Is there a way to just quickly stop comfy from occupying so much ram without restarting?
>>105881766Last time I updated is when the new SLG implementation was added and it's working fine for me.
>>105881766I only update when there's some new feature I really want/need, you are essentially a perpetual beta tester with Comfy, so things will likely break.
file
md5: e190f2634e1a99d671889cfe29056159
๐
>>105881748It's a convoluted loramaxx workflow with two ksamplers that I'm still testing and adjusting. Half of those snakeoils can somewhat improve the prompt comprehension and/or animation quality, but they absolutely mutilate the result if used at full strength for the whole generation, hence why I'm adjusting the weights to avoid that.
>>105881721>when a wall shadow destroys the illusion
>>105881840There is a button "unload models" and another called "free model and node cache". These should reduce your vram footprint down to minimum.
>>105881982>Secret sauceThe fuck is that
>FaceNaturalizer>DetailerI bet those slop the shit out of the video
>>105879314This image is a digital pixel art composition featuring a collection of fantasy-themed items. The background is plain white, which contrasts sharply with the detailed, colorful objects arranged in a grid-like pattern. Lines are distinct and emphasized with sharp white or dark lines for contrast. All objects are oriented in a vertical position.
In the left from top to the middle, a large detailed {ornate|ancient|etc} magical {chainmail|plate|full suit} medieval armor with {spiked pauldrons|a family crest|a glowing runic enchantment|multiple layers of armor} and {scuffs|dents|light cracks|heavy cracks|perfect condition} is depicted. Next to it are blah blah blah is depicted.
In the center from top to the middle, a large detailed {ornate|ancient|arcane|simple} magical {steel|iron|copper|gold} {long sword|battle sword} with {simple|bejeweled} pommel, blah blah blah is depicted.
In the bottom row there are two {amulets|necklaces|medallions|pendants} blahblahblah
In the left from middle to bottom a large detailed {round|kite|tower} shield is depicted, {fresh and vibrant|dry and aged|rotten and spoiled} with {fishbone|houndstooth|broken line|tile|harlequin} patterns, with a {wooden|metallic} frame trim.
Additionally, a blahblahblah
All objects share a theme of {leaves, flowers, nature, flora, and the spring season|death, decay, rot, bones, and gore|ice, snow, stones, dwarven runes|led light strips, sci-fi cyberpunk technology|astrology, stars, and the cosmos|ghosts, nostalgia, eerieness, and the supernatural}, with occasional motifs of {angel wings|bat wings|viper tongues|beast claws|phallic symbols}.
The style of the image is classic pixelated pixel art. Shading is dithered.
>>105877925my favorite so far unironically