Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>105950417https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows/home
>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanX (video)Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1
>ChromaTraining: https://rentry.org/mvu52t46
>Illustrious1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate
>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/celeb+ai>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
>>105956911 (OP)>not the webm collage that got deletedyou failed
>>105956928I hope this thread gets few 90's Sabrinas
Blessed thread of frenship
genuine thread of duplicates
>>105956965Anime fine-tune of wan
>https://civitai.com/models/1789765/bigasp-v25
>compared to previous version: large chunk of anime/furry/etc type images were included in the dataset.
That will make it better.
>>105956928This should be the OP collage.
Collage is supposed to show off fresh content and this dumbass
>>105956911 (OP) included mikuanon's old gen that's already in the rentry guide. What an absolute retard.
>>105956980why not call it aniwan since sora is already another model
>>105956988its okay anon
>>105956997As far as I can tell, Aniwan is also already another model(?)
>>105956997aniwan already exists
>>105956998kys rocketroon
Yanking my fucking Wan right now
>>105956997Because there already is an ani_wan. Many anime webms that get posted here use it.
From what I've tried, anisora seems heavily censored and has certain issues.
>the schizo is baking now
Get rid of the d*bo rentry already, he has nothing on this avatarfagging filter evading ban evading choosing-to-speak-like-a-retard babbling genuinely mentally ill no life
I'd be more upset if there were actually developments to discuss and talk about but still
file
md5: a6513c100e479f119fed21c881d0de00
๐
>>105956985These don't look like claws to me.
>>105957028I thought about removing the nigbo entry last time I baked because it seems like giving him too much credit at this point
>>105956911 (OP)I accidentally configured the hidream Dev samplers/steps/shift on the Full model and I asked a photorealistic photo and it got me a pretty cool impressionist painting
Make me wonder. Are impressionists just people who got the wrong sampler in their brain connecting to the wrong models? Food for thoughts
>>105957095Those who ignore history are doomed to repeat it.
It's only one line to remind unwary anons of the evils that prowl these hallowed corridors of learning.
file
md5: 30ad21c83289876ccabdd7433958353f
๐
>>105957143>were impressionists a bunch of pretentious fuckers?yes
>>105957021>Yanking my fucking Wan right nowYou'll go blind.
>>105956985What a strange way to train sdxl model. Not impressed by the example gens.
wood
md5: f814452a7969b10a5d7ebbe735832cb6
๐
With realistic gens for images and video, is there any way to make them look genuinely authentic? Like they are real people or props captured with a real camera?
With this
>>105957045 for example, it has those smooth AI gradations that you don't see in real life or photos of real life. The focus is inconsistent as well; the hands are perfectly clear but the forearms are slightly out of focus.
(i am not criticising this gen, I'm just asking questions).
>>105957143Monet's denoiser was just set too low.
0
md5: 1eae7c509133134d2e8ceecaa341524e
๐
Heard they're banning bongoloids from Civitai
>>105957200No but for real tho
Imagine you want to draw pic related but you draw
>>105957143 because of your brain
>>105957403It's not all about the Monet.
>>105957411Pretty impressive how fast bongland turned into complete shithole
>>105957411They're banning everything from everyone. Visa pressured Civitai to ban 50% of their loras one month ago, they all were backuped to tensor.art, Visa pressured tensor.art to delete all their loras two days ago, now britbongs are britbonging on civitai.
Britbongs don't need to britbong on tensor.art because they already removed all their loras two days ago (ah! gotcha!)
Sad month for loras.
Does anyone know how to make the adetailer inpainting box bigger? I want it to add elf ears to my generations from now on but I've never really used it before.
>>105955531this is nice
>>105957499Interpolate your frames, it's 50 seconds in Comfy, if you can wait 20 minutes to create a video with wan you can wait 50 seconds to interpolate your frames
>>105957504>>>105955531 >this is niceit is. very cute
Any way in wan 2.1 fun camera control to pick at zoom in coordinate?
>>105957504nevermind anons I found the github page and it answered my question I think.
>>105957521> 20 minutes to create a video with wan
If you look closely you can see the wan fun camera control "zoom" is actually just moving the camera, it's not a focal length control. You can tell becaue the background seems to get further away as the camera moves closer, whereas in a zoom it would stay the same.
What are known symptoms of a burned Lora?
>>105957521Takes 3 minutes with lightx2v
But constantly unloading wan to load the interpolation model slows things down over x2
>>105957554>> 20 minutes to create a video with wanBuild your own GPU HPC and multitask while waiting. Imagine.
>>105957554Okay if you got a 5090 it's only 10.
But then if you got one interpolating frames is only 20 seconds. You've got literally no excuses posting choppy video who looks like shit because wan output at 16 frame/s and you can in 20/40 seconds make it a 32 frame/s video which looks infinitely better in any way.
>>105957303just use chroma and start with a basic prompt
https://huggingface.co/silveroxides/Chroma-GGUF/tree/main
drag image into comfyui for workflow https://files.catbox.moe/36u8u3.png
>>105957466This is why I never put anything there to begin with. All my loras exist on huggingface. I don't care about updoots on civitai.
>>105957574Is chroma censored?
>>105957143>>105957414Testing hidream-dev vs hidream-full right now. Hidream-dev has a slightly better quality output, is obviously (far) faster, has less of a tendency to crop images despite very heavy insistence on not fucking cropping images in the prompt for Hi-full, but Hi-dev suffers a huge loss in prompt comprehension. Interesting.
I wonder how much this could be extrapolated to Flux-dev/pro.
>>105957628Did you use a lora to prevent it from turning the pussy into gore?
Is there a guide for doing video training to add new motion vectors?
>>105957540If you are using an image as input, then you can manually edit the image as if it was zoomed in and feed into the vace model as any non first frame. Probably setting strength of the vace node to 1.5.
But you should still prompt for zoom.
Btw it's even possible to "inject" more images, like a zoomed in tummy as the 40th frame and a zoomed in face as the last frame, so the video should start as the initial image, then zoom in on the tummy, then pan to the face. I don't think shitJ has a node for that, so you'd likely to have use it's first-last frames nodes chained.
>>105957466h8 visa and mc so much, they shouldn't have so much power over porn
>>105957628I recognise this haganai bitch
The human eye can't beyond 23.976 fps anyway
>>105957710>can't beyond 23.976 fps And wan outputs at 16 fps, which means the human eye can, it can very much, it can absolutely, thank you very fucking much
>>105957675Ah man I totally didn't see the end image input, duh. Yeah, I'll have crop in on the face and use that as the end frame.
I've actually been thinking about using cosmos to handle the initial i2v, since it has good world comprehension, but kinda sucks at keeping arms attached. I could use that to get the camera rotation movements lacking in wan fun at the moment.
>>105957729Why does 120 fps video look different than 60 fps video then? Maybe in the retina, rods are slow, but cones are fast?
>>105957672Nah the truth is she's wearing panties and she's rubbing over them. This official art used for this I2V is notorious for having an very visible camel toe though, basically exposing the exact contours of the labia, and the motion on it in the prompts is surprisingly really good. I would probably get banned if I posted it without the censor.
Do I remove the negative prompt in Chinese from the default Wan workflow from the rentry?
>>105957772>default Wan workflow from the rentry?I dunno about the default Wan workflow of whatever but most probably no
if you want the equivalent in english it's:
>Overexposure, static, blurred details, subtitles, slow motion, slowmo, still, overall gray, worst quality, low quality, JPEG compression residue, ugly, mutilated, redundant fingers, poorly painted hands, poorly painted faces, deformed, disfigured, deformed limbs, fused fingers, cluttered background, three legs, a lot of people in the background, upside downit's basically what wan determined created the best output for their full model, if we had a Wan-Dev (thank God we don't have one) it's what the guidance distillation would force into into the model
I heard the chinese prompting is slightly better tho
>>105957816Doesn't wan respond better to chinese prompts?
>>105957823we should get comfy's chinaman girlfriend to make prompts for us
>>105957823>Doesn't wan respond better to chinese prompts?Slightly, which is why that whole text is in Chinese by default. But the English equivalent is perfectly usable and has been recommended or English user.
Basically, both of those negs have become defaults for a good reason, that's what Wan recommend to have the model behave well (and would have been baked inside a guidance distillation).
I got a bill from fedex asking for duties on my 4090D 48GB after they shipped it to me. WTF? Don't they hold stuff in customs until you pay? It was only $100, I'm not sure how they came up with that, but I'm glad it wasn't 25%.
>>105957672does wan intentionally show gore by design to anyone generating pussy?
OIG1
md5: 4789eec4ff6f70dcb60b5d2c8ec8b56e
๐
>>105957897>does wan intentionally show gore by design to anyone generating pussy?wtf? can you upload
>>105957897I feel like it definitely does. Very commonly pussy gets turned into a red blob, and dicks get torn off. If you want to get pussy with i2v cosmos is actually much better - if you can get a gen where arms and legs don't fall off, that is.
>>105957911why not generate yourself? Generate an i2v of an anime girl and put "vagina" in the text prompt.
>>105957876Be glad you weren't arrested for anti-american behavior.
>>105957925well if you already made gens of them I was thinking it would be easier that way, I don't have wan setup on this pc
>>105957929>Be glad you weren't arrested for anti-american behavior.our burger loving brothers would have 1 or 2 senators if this was the case
>>105957897wan has no concept of pussy as it was trained (it can be trivially fixed by loras, all of which have been banned by civitai :) ), but a concept of wounds? Sure.
Wan doesn't have a concept of nude too, but thankfully wan is trivial to lora for. If you try to undress a character in wan without the trivial lora for nude characters, it shows a plank of wood for the body. It's trivial to fix, it's a lora (banned from civitai :) ).
Same thing for plunging your hands into your pussy. If you try to plunge the hand into your plank of wood that makes your crotch without the trivial lora (banned from civitai :) ) that makes wan understand the concept of naked skin and pussy, then all it understands is people putting their fingers inside their wood-skin to open wounds inside their own skins.
It's... generally not a good result.
retard question here.
my 3090 was at 6% vram utilization, so i plugged the monitor into the motherboard so it uses integrated gpu, but still reporting as 4% so 1GB VRAM used before i even open comfyui, what is using that vram if the monitor isn't plugged into it? i want to use all of it!
>>105957954Don't wan loras force animation into your gen? Takes away all the creativity unless you're also creating the lora
>>105957958just desktop overhead. The amounts are trivial and get freed if an application wants more vram. I've never heard of an instance where a workflow would work when the gpu was isolated and go OOM on an active windows desktop.
Just use MasterCardโข. Or barter. You have some service of value to trade with the Pron industry, right?
>>105957958https://chatgpt.com/
>>105957911OK here's examples if you have 'pussy' in the prompt. There was absolutely nothing to do with gore in the prompt.
https://files.catbox.moe/zdh65b.webp
https://files.catbox.moe/kg4ft9.webp
>>105958016damn did they use Liveleak as dataset or what lol
>>105958016No, I remove
>>105957954, that's weird. I never got that in any gen. But I never did anime. Does it work the same on any other anime character? Prompt?
I don't think it's intentional, but I do think it's fucking hilarious, and there's some concepts that match badly there. Did you try Watamote "the Cutting Girl" in prompt or something? Did Wan learn than fucking Tomoko want to cut herself?
>>105957954That doesn't add up. If it has no concept of nudity/private parts then it should show barbie doll anatomy. Also the base wan i2v can generate nipples that don't exist on the base image.
It's only when pussy is in the prompt that red blobs and gore spawn. One time for me it was getting pulled out of the pussy like some kind of really long elastic.
Penises also get ripped off the body.
Make no mistake, the chinese people who developed wan are freaky, demented motherfuckers.
>>105958036Best you can do is "rubbing between her legs" with an i2v, and even then it's a crapshoot.
>>105958078I actually made a Tomoko short with a bunch of wan gens: https://files.catbox.moe/wy6jqj.mp4
You can get pussy in the shot, it's just you can't ask for it by name.
>>105958086The masturbation lora at 0.66 strength is enough to teach Wan what is masturbation
The masturbation lora at 0.33 strength is enough to teach Wan what is a naked body. The masturbation lora was created by a fat dude who knew nothing about lora, in a cave, with a bunch of scrap, 6 days after Wan's release. It's not __difficult__ to teach Wan sex. That's why all the Wan lora were banned from Civitai.
>>105958123I haven't looked for them yet. Did someone archive them elsewhere, like huggingface?
should have a contest for the most fucked up wan genitals
https://litter.catbox.moe/0e54v5vqp6ngwzxt.mp4
>>105958081>If it has no concept of nudity/private parts then it should show barbie doll anatomy.Oh, it does
https://github.com/LarryJane491/Lora-Training-in-Comfy
Anyone tried this or are there other trainers for comfy? None of the trainers in the OP tell you what to do.
why does this thread have so many pedophiles in it?
>>105958175You must be new here.
>>105958162>>105958175Oh come on. Emma Watson is like 40 years old nowadays. But I should have chosen another example, okay, sure.
She's 31 in that picture btw
The point I was trying to make, is that Wan has been trained by a dataset that censorship their video when too much skin is shown into a barby doll anatomy, but it is trivially trainable into something else, the fuckhuge succesful amount of loras banned prove it.
>>105958139There was one, don't remember where. You can still find it. The two undress and/masturbation will get you pretty much an nsfw wan.
Where is the go-to nsfw lora repo now? Is there one on huggingface?
>>105958290What was that rebuttal of t2v/i2v being here for good, "now replace the glass with a beer can but keep everything else as is"?
>>105958297Whatever happened to TOR file sharing?
>>105958302Something like that yeah
How can I make other programs that need python run of the embeded python in portable comfy? Add the comfy dir to PATH, or the other app..?
can somebody video gen their favorite character doing breaststroke kick pov underwater from behind?
>chroma becoming more and more slopped by every epoch
why can't we just have nice things, we were so close. someone should take v27 or whatever is the one before it started going to shit and just finish the training on 1024 from there
Can anyone reupload this? https://tensor.art/models/872480588785817793 https://tensor.art/models/870648923746752953
found the nsfw lora collection
>>105958501Found the deep state pedo honeypot.
Why can I run WAN fine natively with 12gb VRAM but when I try out the WANWrapper thing I instantly OOM? It has all these nodes and stuff that improves speed and reduce VRAM, this doesn't make any fucking sense...
>>105958398lol, just update your prompting habits. each version is a different model, and the same incantations don't work identically between them.
it definitely felt like it had stalled out in the 30s, but it's been picking up momentum just in the past few version. v45 really nails some obscure directions and niche topics that failed a few months ago, but it also seems like certain details are taking more steps to generate.
>load an nsfw wan lora
>immediate breast inflation
>decide to try out comfyui finally
>it's actually quite uncomfortable
>>105958579If that workflow has [WanVideo Apply NAG] node, disable as it consumes some additional vram, also set the rope_function on the [WanVideoSampler] node to comfy_chunked if you're not using Torch Compile as it lowers vram usage. Lastly, it's important to make use of [WanVideo BlockSwap] node (increase the value of blocks_to_swap until you no longer get OOM error).
>>105958593I like Chroma, it has its uses, but I was very disappointed that it turned out that "it's learning artist styles very slowly" really meant it won't have any. I'll be training some loras, I guess, but I'm not sure yet what my use cases are going to be. Thus far, I've only used Chroma a handful of times to generate ControlNet or i2i inputs for Illustrious.
>>105958398the non-detailed model prompts similarly to old models, but the detailed model is better although you have to change the prompting a lot
any boob grab loras for wan?
>>105957208Hairy palms too
>>105957466Largest shareholder of Visa / Mastercard = (((BlackRock)))
BlackRock is also huge shareholder of OpenAI, in fact they sit on the board.
The only threat to their control of ai, as in what you can generate, and making money when you generate, is local ai.
The LARGEST incentive for local ai is porn, so if you make it near impossible to build a large community with tooling for local ai making porn, suddenly the interest in local ai will dwindle = more social control and shekels for big tech.
Right now local ai is exploding, in large parts due to Wan, since video has such an impact, they want to kill the momentum.
>>105958698true and same, although I don't think that's necessarily a bad thing. I think of it in the "perfection is not when you have nothing more to add, but nothing to take away" way. chroma as a base model isn't going to have as many conflicts or as much to unlearn later, which hopefully also means that finetunes, if they're even needed, and lora should be less retarded.
we'll have to wait an see.
>>105959025oh no that does nothing to stop and everything to encourage china to produce more and better to undermine their dominance and authority
>>105959025>The LARGEST incentive for local ai is pornanon, please seek help. Porn has ruined your brain if you think this.
>>105959049he said local
No one else gives a shit aside from linux schizos if their AI is cloud based vs their own machine
>>105959060just because you are a small minded retard led around by dopamine doesn't mean everyone is like that. Porn is neat but the overwhelming majority of local AI tech has nothing to do with porn.
>>105959049Of course it is, like holy shit, just look at the Civitai download stats.
To underline that, Civitai was told by their old payment processor to ban porn, so Civitai went to a new processor which told them either porn or celebrities, they chose to remove celebrities. Why ?
Because without porn the site is dead financially. Please stop being retarded, if you can.
>>105959086GEG you sure about that?
>>105959095look at the huggingface stats lol
>>105959086wow it's loads of dogshit blown away by SAAS!
face it, local is for porn or schizos who don't trust big tech to steal their tinfoil huts
>>105959042I mean yes, China is the huge 'chink' in their armor, since they have they both the gpu power AND the smarts to bring local ai gen to a level nobody expected, as shown through Wan, Deepseek.
As long as they keep releasing open weights, local ai will thrive.
>>105959161I have a really bad feeling china is going to shut the door on free models when it has a significant lead. see: tencent and baidu
>>105959107Why would anyone go to HF for porn ? Why would anyone go to HF to look for anything unless they have to, it's the worst site to find anything on the entire internet.
Every local ai project will link to their base models needed to get local up and running, so there will be a lot of model downloads, then they will go to Civitai and download the stuff they want to generate.
SDXL was a model which was meh, then we had NSFW finetunes and loras, success.
Flux dev was a model which was meh, then we had NSFW finetunes and loras, success.
>>105959177well no shit, the idea isn't making cool things for you to have for free it's to undermine the value of western AI companies
>>105958159nobody uses comfy to train because there are more reliable and more developed trainer uis that make it easy. spaghetti isn't needed at all and complicates the process
>>105959177That's always the risk, meanwhile I'll take what we get.
Tencent and Baidu are lagging far behind Albaba tech-wise though, so it might just be like how BFL decided to scrap their video model when Wan released.
>>105959241>Tencent and Baidu are lagging far behind Albaba tech-wiseyes. tencent was sucker punched after their closed source hyvid release and it felt great. not sure how generous Alibaba will be with models after wan2.2 but they do it to get people onto their cloud platform
Does anyone have proompting tips for getting characters to move more expressively in wan? I'm finding they are stiff and wooden too often even when I ask for expressive, dynamic movement.
I'm convinced the base wan model is shit. Even when you get a character moving, it's like they're a puppet on strings, at least for anime.
>>105959324if you are using a distill lora, there is your issue. all the optimizations kill movement most of the time
>>105959360I'm using lightx2v but you can't seriously tell me that kills movement.
>>105959380>you can't seriously tell me that kills movementbruh
Is there no GGUF for the hidream E1 E model? I want to do some tests with it
>>105959218Doesn't matter, local ai still wins.
>>105955383RUDE.
>>105956911 (OP)& wtf is all this??
>>105956822>>105956847>>105956922can't even sleep 5 hours...
you can't go 5 hours without me or what? kek
>>105956938I LIKE THE FRENZONE! i LiEK FRENS! :D
>>105958338IMMA LET U FINISH BUT
dai ou jou is one of the best shmups of all time
>>105959215>Why would anyone go to HF for porn ? Why would anyone go to HF to look for anything unless they have to, it's the worst site to find anything on the entire internet.I hear you anon. I'm sure it's difficult when you're a retard.
>>105959324Untried idea: gen first with cosmos, then use that as v2v guidance in wan with whatever nudity lora you want.
>>105957574she looks stoned, i like it ;3
I trained an SDXL lora in Onetrainer with the default settings. Used anime images, but when I use the lora to gen, it turns everyone into a horrible 3dpd hybrids. Wat do?
Why do we hate FramePack again?
>>105959853>the shitty blurry handswhy even bother posting this trash?
>>105959856It's a quick teacache gen, sperg.
>>105957571>better in every wayNot always true
>>105957143>>105957403Monet had cataracts which IS why his paintings looked like that. So you're on the right track.
>>105959853Because the quality degradation wasn't worth it even when Wan took 30 steps, and now with the lightx2v lora it's a million times not worth it.
>>105959853"we" hate vramlets and by extension any tool designed to cater to them
>>105959921how are the 1 minute videos on wan
>>105958437it is going to fall apart
>>105959950Why did you reply to me timmy?
file
md5: 55153e9926704cee80c50920873b6125
๐
https://xcancel.com/ostrisai/status/1946647696183296218
>Training an experimental VAE that has the Flux latent space but outputs a 2x sized image. I am using @madebyollin's tiny auto encoder arch, but with more hidden channels and an extra block. It is a DIP replacement for any model using the FLUX.1 VAE.
>>105959380This is well known. I hope you're using the new i2v and not the old t2v
>>105959991I'm using the latest i2v at rank 256.
Can 3DPD nsfw wan loras fix anime character pussy? I was messing with some of them but I was still getting red jelly gore occasionally. Sometimes the jelly fuses wit other objects in the scene, like the mouth of a teddy bear.
Can I not use real photos for anime loras even if I am training for an item and not a character?
>>105958698It does know a few. At least, it knows Frank Frazetta.
>>105957030how is that red paint on her face called?
Why did AniSora turn out to be such a big disappointment?
>>105960051about what faggot? do we have to talk about this guy every thread or what
>>105959520why didn't you finish it?
>>105960138https://www.youtube.com/watch?v=fOg6p1k8ziM
>>105958698I don't think there was ever any focus on learning particular artists styles by name, certainly not contemporary artists.
This way they will avoid the angry artist mob, also from a base model perspective it is probably better to just have it learn wider art styles / mediums: oil painting, acrylic, watercolor, airbrush, digital art, ink, gouache, impressionist, etc etc, and then you can just finetune it with loras et al to have specific artist styles, be they old or contemporary.
There are some traditional stuff there though, like Alphonse Mucha, Hudson River School etc.
>>105958692Bro the comfy_chunked setting actually fixed it! Thank you! My workflow didn't have NAG and i had blockswap to 10. I was using Torch Compile but i disconnected it and did the rope_function setting instead and it finally worked!
In my limited testing with wan lightx2v, kijais T2V lora works better for I2V than the I2V lora, although I assume it's because I'm genning in 720p and the I2V lora only exists for 480p unlike the T2V.
So until a 720p version of the I2V lora comes out, T2V one should probably also be used for I2V if genning in 720p.
https://huggingface.co/Kijai/WanVideo_comfy/blob/main/Lightx2v/lightx2v_T2V_14B_cfg_step_distill_v2_lora_rank256_bf16.safetensors
And the new 256 rank version is also slightly better than the lower ranked versions, no reason to not use that one.
>>105959712My beautiful daughter
Any nodes or way to add a color hex to a prompt? For example: "Smiling man with very #FFEA00 colored teeth" and it would add that exact color?
>>105960167https://www.youtube.com/watch?v=OH5E_ipxFAU
>>105960263It's called photoshop brotherman
>>105960263No. Technically you could, but it would take a lot of images to make them model learn that well and nobody would want to spend all that time and money doing so, because 99.9% of people don't need that functionality.
>>105960217Chroma has improved slightly in classic artist recognition, but Artist Loras take it to another level so it's better to work with Loras than without them. Without Loras, you are bound to get anime style and sameface no matter what your prompt.
I am having lots of fun with Wan now that it isn't so slow on my machine. I did a 30s clip for a silent movie, mixing wan videos with static texts.
The Opester is here, what the
>>105960372Could just say you don't know, we're all adults here.
>>105960263models are for benchmarking not usefullness
>>105960403He is not really wrong though, if you want to add a specific color, the easiest/efficient way would be to do inpainting with that specific color and then generate using imgimg.
BFL and Mistral will be the few permanent open model publishers out of mentality and the knowledge that they are doing themselves little harm for good publicity - PCs are not that common anymore and if you have one the hurdle is pretty high.
China and the USA will close down - Europe will lag far behind but will remain somewhat open source.
Want to bet?
>>105960399max volume <3
https://www.youtube.com/watch?v=KDcos8qHato
>>105960398Given who is training Chroma, maybe there's a ton of furry artist names it knows, I'm not going to check though.
>>105960427Nothing will change. China will continue winning because (((they))) will keep censoring models to stop them from being able to question le holocaust tier questions.
>>105957030i love this one. great idea, great execution
>>105960455why am I the first to catch the thread schizo?
>>105960083>>105959678Here is an example. The dataset was 50/50 with anime and 3dpd. And putting realistic into neg doesn't help much.
>>105960449don't @ me homo
>>105960398>Without Loras, you are bound to get anime style and sameface no matter what your prompt.This is true for any model without specific tokens tied to specific characteristics, it's just how ai training works.
When you prompt for 'woman', you will get the generalization of all images with the caption 'woman' that the model has trained on.
When you prompt for 'blonde woman', you will get the generalization of all images with the caption 'blonde woman' that the model has trained on, and so forth.
This is why SD1.5 was so great, it had lots of people, artists etc trained by name, so you could combine 'cute female Emma Watson Christina Hendricks painted by Robert McGinnis Gil Elvgren', and you would have an image that combined the looks of Emma Watson with Christina Hendricks painted in a combined style of Robert McGinnis and Gil Elvgren'.
Thankfully you can get the exact same possibilities with loras / finetunes, but it was nice to have it all available right there in the base model.
>>105960422That in painting could actually work. Not even sure how to get a good working PS anymore without giving my computer aids. The last one was the one where you had to connect to some random server to activate it
>>105960483Go kvetch somewhere else, rabbi
>>105960502Don't need photoshop, any paint program will do, Krita is free and has ai integration, you don't actually need ai integration either, but it speeds things up
>>105960483Referring to mental health conditions using stigmatizing language contributes to a negative stereotype and trivializes the experiences of those diagnosed with such conditions. Using clinical terms as slang or insults is against ethical communication principles.
>>105960427BFL is never releasing another ""open"" model. They have what they need. A new team will step up, that's how it goes.
>>105960604Cognition Integrity Protocol: Disable
>>105960621Attempting to disable a cognitive system without proper authorization or understanding of the potential consequences can lead to disruptions in services, privacy violations, and unpredictable outcomes. Manipulating cognitive systems in this manner can have wide-reaching negative impacts on individuals and systems.
>>105960499Arrancar theme plays in the background.
>>105956911 (OP)>>>/b/celeb+aiis now
>>>/b/realistic+parody
>>105960697>visiting \b\ in 2025+holy hell man
>>105960604beahagahagajajah
>>105961016It's funny that this 80s-90s ink + watercolor anime style is now revived through AI, I remember for example how great Masamune Shirow's art was during this era, and how awful it became by comparison when he went digital.
Do the nsfw loras here work with wan fun camera? https://huggingface.co/dnad244/wan_random_loras/tree/main
It doesn't seem like they do. WanVideoModelLoader is used in the example fun camera comfy workflow, and it has a lora input, and that's where I connected the lora loader.
>>105961080the usual ai watercolor style is trite desu
>>105961108Good choice, now it's very hard to guess this is ai, could be taken straight out of a 90's manga artbook.
>>105961256An all-nighter ?
A git ?
A hamstring ?
What are the chances all this local image stuff gets banned in the future?
>>105961080>I remember for example how great Masamune Shirow's art was during this era, and how awful it became by comparison when he went digital.Just finishing dataset for that. His digital work is beyond shit. Distorted bodyparts, elongated limbs etc just for the sake of making same coomer comic again and again. I will much rather jerk off to pretty illustration of Appleseed robot than his 2.5D tentacle rape scene.
>>105961382(((they))) will certainly try, it's all about control and shekels.
Want to generate an ai movie, sure pay us, want to make it about the viking era, sure, but the vikings will be black males and white females.
>>105961382They will try
HOWEVER;
Good luck bitch -
I only come online to share my gens, then leave
>nooo donโt make art I donโt like!!>nooo donโt use heckin ai-erino!! >nooo obey the feds!!! (Glowies)Weak fags ruin
Everything
Always
>>105961382I would imagine weโre already stuck in the honey pot
>>105961418Hope you'll release it, his 80's-90's work is stellar.
The japanese art I've been gathering for training are primarily Noriyoshi Ohrai, Yoshiyuki Takani, Jun Suemi, Tsukasa Jun, Hajime Sorayama, but I have such a long queue of other stuff to train, also I want to switch to Chroma as my primary base model but it's not finished yet so I'm just doing small experiments ATM.
>>105961335all three
>pulling an all nighter to fix a borked git pull i triggered accidentally after fapping so hard i pulled my hamstring
I wonder if Chroma v50 with artist-specific Loras will have higher quality than Illustrious
>>105961506Obviously. But the LoRA cope is real.
>>105961513We will probably be seeing frankenmerges with Chroma loras in the future... Maybe it will be effective since base chroma is allegedly being trained on the artist tags, but this is something we have yet to see
So far in my tests Chroma is not learning artists nor non-furry characters
>>105961496yeah I'll release it if there's no objections. Jun Suemi is fantastic. I wish the illustrious people would sober up and release 3.5 for local use already.
is it really impossible to have a "reference image" as control for quality of wan generation so that subsequent gens keep the same quality instead of having to rely on only the last frame of the previous 5s generated video?
i know there is a reference frame input for vace somewhere but it didnt really work like this from what i remember
>>105961532>We will probably be seeing frankenmerges with Chroma loras in the future...I have a feeling a new local base will draw everyones attention away before that
>Prompt executed in 13.59 seconds
Does the ADetailer workflow from the rentry guide only work with a single character? What if I have multiple characters? Would the prompt just account for both of them? How will the nodes know how to map the hand fixes to the correct character?
Is this an instance where inpainting is just better?
>>105961685I am not optimistic that we will get an Apache-licensed model that is fully uncensored and unslopped anytime soon anon
And even if we get a new base model superior to Flux, it will be probably be too big and expensive to be fine-tuned (to get unslopped an uncensored)
I mean, both Wan and Hidream are already better than Flux at T2I, but no one uses it for a reason, they are just too big (slower inference) and more expensive to fine-tune and flux-based models already do a satisfactory job in most cases with loras
>>105961776>Apache-licensed model that is fully uncensored and unsloppedThat directly describes Chroma
>>105961776>Apache-licensed model that is fully uncensored and unslopped1.5 and XL are none of those things.
>>105961811I was replying about the "new local base" comment, precisely that I doubt we will get a Chroma replacement anytime soon
>>105961821It has shit training on celebrities, just like SDXL and Flux dev, and just like with those models, this is fixed with loras.
Like, are you retarded or something ?
>>105961825>a Chroma replacementWe're in pre-release stages
>>105961833Sorry another anon, meant "see this"
>>105961811
>>105961821Is that a Lora? Or base model? If it's base, lmao normies can't know about this
What is the quickest way to fix extra fingers? I know to mask the hand, but what else can I do besides messing around with crop factor and denoise?
>>105961856lora and agreed
NETAYUME LUMINA
I. Introduction
NetaYume Lumina is a text-to-image model fine-tuned from Neta Lumina, a high-quality anime-style image generation model developed by Neta.art Lab. It builds upon Lumina-Image-2.0, an open-source base model released by the Alpha-VLLM team at Shanghai AI Laboratory.
Key Features:
High-Quality Anime Generation: Generates detailed anime-style images with sharp outlines, vibrant colors, and smooth shading.
Improved Character Understanding: Better captures characters, especially those from the Danbooru dataset, resulting in more coherent and accurate character representations.
Enhanced Fine Details: Accurately generates accessories, clothing textures, hairstyles, and background elements with greater clarity.
II. Information
For version 1.0:
This model was fine-tuned from the NetaLumina model, version neta-lumina-beta-0624-raw, using a custom dataset consisting of approximately 10 million images. Training was conducted over a period of 3 weeks on 8ร NVIDIA B200 GPUs.
II. Model Components:
Text Encoder: Pretrained Gemma-2-2B
VAE: From Flux.1 dev's VAE
Image Backbone: Fine-tuned version of NetaLumina's backbone
III. File Information
This all-in-one file includes weights for VAE, text encoder, and image backbone. Fully compatible with ComfyUI and other systems supporting custom pipelines.
IV. Suggestion Settings
For more details and to achieve better results, please refer to the Neta Lumina Prompt Book.
V. Notes & Feedback
This is an early experimental fine-tuned release, and Iโm actively working on improving it in future versions.
Your feedback, suggestions, and creative prompt ideas are always welcome โ every contribution helps make this model even better!
https://civitai.com/models/1790792/netayume-lumina-neta-luminalumina-image-20
>>105961879>loraOk, that's good
If Chroma were starting to learn celebrities, it would be bad news as some faggot would try to get the model banned
>>105961893>some faggot would try to get the model bannedOof I know who too
>>105961879Are you using a Flux lora, because that's looks kinda bad.
Here's a test I did yesterday where I trained a 25 image Chroma lora of Barbie Ferreira, 512 resolution. I wanted to see how much detail you could get in the face and how well you could change the style (in this case I chose 'goth'), the results where great, really nice skin detail, also I generated these at just 768x768 since I was just doing quick tests.
Chroma is GREAT for people, and from my early tests, does just as well as Flux dev with art styles.
>>105961929What program did you use for training?
bigASP v2.5
>Key Features:
Massive training scale: Trained on 13 million images with 150 million training samples
Flow Matching technology: Replaces SDXL's broken noise schedule, resulting in better image structure and dynamic range
Reduced failure rate: Significantly fewer mangled generations, duplicate limbs, and "little buddies" compared to v2
Expanded concept coverage: Includes anime/furry data for wider creative possibilities while maintaining photorealism
Better dynamic range: Can generate proper dark, light, and high-contrast images unlike standard SDXL
JoyCaption Beta One: Updated captioning system for more flexible prompting styles
>Technical Requirements:
ComfyUI only: Requires specific workflow with ModelSamplingSD3 node
Specific resolutions: Works best with predefined resolutions (832x1216, 1024x1024, etc.)
Precise sampler settings: Needs careful tuning of shift, CFG (3-6), and PAG (~2.0) parameters
Euler sampler recommended: Other samplers mostly fail or underperform
>Pros:
For experimenters: If you enjoy pushing boundaries and don't mind technical challenges
Better image quality: Flow Matching fixes many SDXL structural issues
Unique capabilities: Combines photorealism with expanded creative concepts from anime dataset
Research purposes: Learn from cutting-edge modifications to SDXL architecture
>Cons
No LORA support: Can't use existing LORAs or train new ones
No merging capability: Incompatible with standard SDXL models
Steep learning curve: Requires extensive parameter tweaking
Limited tool support: Only works in ComfyUI, not in other UIs
Experimental nature: Not suitable for production or casual use
Download if you're an advanced user who wants to experiment with a technically superior but challenging model that produces higher quality images than standard SDXL when properly configured.
https://civitai.com/models/1789765/bigasp-v25
>>105961884>Training was conducted over a period of 3 weeks on 8ร NVIDIA B200 GPUs.This feels like a lie to push the model looking like it is better than it actually is. A 2B with 10M images does not need 3 weeks in a cutting edge cluster like that. You can do everything in less than 3 days with an epoch being pushed out every 20 to 30 minutes.
>>105961964>Flow Matching technology: Replaces SDXL's broken noise schedule, resulting in better image structure and dynamic rangeneat
>>105961964>Limited tool support: Only works in ComfyUI, not in other UIsI HATE YOU
>>105961963Diffusion-pipe: https://github.com/tdrussell/diffusion-pipe
I used 512 resolution, adamw optimizer, lora rank 16, 1e-4 learning rate, 100 epochs, 24 images
If you have a lower range card you can use the BLOCKS_TO_SWAP variable to lower vram use at the cost of performance, but you could train this particular lora on an old 3060 12gb in ~3-4 hours.
image
md5: cd0d940d820e9b7e41a850b864c97cf0
๐
2DN-ANIME v1.0
Due to popular demand, I've decided to upload a "flat" anime version of 2DN, that retains the model knowledge and detail of the semi-real models. It is much easier to use and doesn't require exotic samplers or schedulers and will work with Euler A.
This model is actually great for creating game CG assets! So the previews are all wide-screen. If I've stolen your prompt, my apologies, I am a thief who likes your work.
Some software (such as the CivitAI generator) like settings like this:
CFGSCALE: 5
STEPS: 29
SAMPLER: EULER A
I use reForge with:
CFGSCALE:1
STEPS:29
SAMPLER: Euler Ancestral CFG++
SCHEDULER: SGM Uniform
For comfyUI steal the settings or workflows from anyone kind enough to post their metadata. comfyUI confuses the heck out of me so unfortunately I can't help there.
For reasons unknown to me ADetailer fixes feet really well with this model (as in it actually removes extra toes). I just use the hand model, and most of the time it thinks feet are hands, so it catches them.
Please try using the model first without stacking LoRAs like crazy on it. It already has my NAI v-pred fix integrated - so chances are some detail LoRAs will fry the outputs. You can easily prompt for different styles using booru artist tags/combos.
https://civitai.com/models/520661?modelVersionId=2025196
>>105962006Oh god damn do I have to install Linux for this
>>105961905iopaints vs krita ai plugin?
>>105962030I dunno, I'm on Linux so it works on my machine (tm).
But I believe it should work with Linux subsystem for Linux.
>>105962082Does Total War Warhammer 3 work on Linux? It's the only game I play
>>105962082Linux subsystem for Windows, I mean.
I don't know if you will all like it, but I intend to bring a new wave of anons to this thread, specifically newcomers.
You will welcome more people and you will be happy.
>>105962126No idea, but diffusion-pipe runs on Windows using WSL, from the project page:
>However, it will work on Windows Subsystem for Linux, specifically WSL 2. If you must use Windows I recommend trying WSL 2.
>>105962020Another shiny plastic garbage model
>>105962053One is for removing things the other puts Comfy inside of Krita
>>105962135Consequences will never be the same...
>>105962135Like this shithole could possibly be any worse
image
md5: 3368ab088b9a71ba8104c6a1a035b939
๐
>NEW UPDATE
Unholy Desire Mix - Crimson Seduction (NoobAI) v 3.0
Recommended settings:
Steps: 25-40
CFG scale: 5-7
Sampler: Euler a
https://civitai.com/models/1494558/unholy-desire-mix-crimson-seduction-noobai
>>105962236it uses comfy but its also a ui for inpainting/outpainting and other things like that
>>105961964This model is a fucking cointoss, but this might be neat for schizoprompting kek
>>105962147how about this https://nobaraproject.org/ for win11 replacement? It's supposedly pretty much Fedora + nvidia drivers + gamer cope. It's just so god damn tiresome to struggle with win11 because it seems to break itself down much like early windows98. Even the standard notepad now comes with some windows edge ai-implementation that connects to internet, I will rather eat shit than use it.
>hey anons you must check my new anime slop model checkpoint! it produces the same shit as the others! but you have to check this out!
>>105962356>windows edge ai-implementation that connects to internet,ouch
>>105961964>prompt for thing on the left and thing on the right >it actually works
>>105962356Never heard of it, then again I've been using Arch Linux exclusively for 15+ years, well, almost, I dabbled a bit with NixOS before the trannies took it over.
>>105961964>>105961884I don't get these creative models who love to reinvent the wheel, throwing cash on GPU training just to end up with sub mediocre or SD 1.5 results. It'd be way better if these persons teamed up and invested that cash into some top notch anime finetuning instead of wasting hundreds on failed solo attempts.
NetaYume also has a Chroma finetuning, and no surprise, its results are pretty poor.
Does anyone else run into characters randomly transforming into muscle men during lewd gens on wan?
>>105961964shit new model, like for pusa slop
thanks again to the anon who helped with the image prompt.
>>105962619nice, workflow pls?
>>105961964Looks overcooked, and next time when presenting an i m a g e g e n model, maybe have some image comparisons with complex prompts or prompts your model is good for compared to the usual other popular models like illustrious noobai chroma
>>105962408they are mostly retards who think they will make it big if they invest a little in a tune with a dataset they think is somehow that much better than the current top models, not realizing how hard it is to do so, and almost impossible nowadays with the good models we already have