Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>106107968https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows
>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanXhttps://github.com/Wan-Video
2.1 Guide: https://rentry.org/wan21kjguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y
>Chromahttps://huggingface.co/lodestones/Chroma1-Base/tree/main
Training: https://rentry.org/mvu52t46
>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage
>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
>>106110641 (OP)blessed thread of frenzone ;3
>>106110664i'll make a schizo-lain or whatever but i dont want to be the only one wasting electricity doing it... ;c
>>106110700still mad huh? ;3
>>106110698Skin too white
GenJam is alive don't listen to the other anon
okay, wan t2i isn't half bad
GenJam is dead don't listen to the other anon
>>106110740>t: sdg tranny with innie penis
>>106110678im submitting mine this weekend
anime, schizo, lain is literally the same theme 3 times and it should have been rerolled
its ok. the 3rd actual time you do something is the first real time you do it. that goes for anything
>>106110756You do realize the first genjam was just one theme?
What is your opinion on the anti-GenJam movement?
>>106110756>literally the same theme 3 timesThen anons submissions should be kino soul mhm
>>106110754>t. poopdick schizo
>>106110756it looks like at least 5 people voted schizo, PROPS FOR ORIGINALITY GUIS
>>106110773yeah and if it was just 1 theme instead of 3 themes that are all the same it would have pissed me off less. there's a psychology term for this i think
>>106110774my opinion matters less than the fact that no one on the entire board posted anything for 39 seconds wtf
Blessed thread of frenship
literally a single anon who routinely posts lain and i think other anons are scared to compete
>>106110347the unfiltered frontpage literally features a "my 600lbs life" woman and gay furry porn now, how is that supposed to be an improvement?
>>106110793why do ppl think genjam is a competition where you "compete"? it's just a community activity where everyone posts their gens for an album and collage. Everyone gets included.
>>106110532i aint reading that shit nigga
>>10611080511/10 i can't tell if this is bait or not so i HAVE to give you a (You)
>>106110712looks a lot like the girl who held in her shit when she spent the night after a party, and when we had sex the next morning, i saw she had the tiniest nugget of poop smashed into her butt hair.
>>106110818>11/10 i can't tell if this is bait or not so i HAVE to give you a (You)
>>106110805the only ones who feel that way are probably simply self conscious i would guess
>>106110820did that experience give you a tiny poop nugget fetish anon you can be honest with us here
why does someone here like 'being a dik'?
>>106110835nta, but I resent butholes and poop entirely. I wish they just weren't there. I hate the fact I can smell them during sex.
>>106110843you're not supposed to be able to smell a butthole during sex.. i never do wtf
oh wait damnit did i get baited again
I don't understand the point of the anime thread, it moves at a snails pace and they don't post anything good or new. I feel like this is some sperg lashing out to a small audience and no one cares
>>106110849Yeah, you totally can. Especially during doggy style.
>>106110840i mean its a classic f95 game. i remember its name despite not remembering anything about it
>>106110876no you cant what the actual fuck am i being baited right now? are you white? are the women you fuck white? are the women you fuck under 250 pounds what the fuck bro
>>106110860Every diffusion thread on /g/ is the result of someone sperging out.
the 2 girls beside the racecar crawl onto the hood and wave hello.
actually worked, based mikucar
>>106110882>am i being baited right now?Now I'm the one being baited. I've fucked all kinds and it's a consistent theme. That gross musty ass smell.
>>106110888Checked. Onlyfans and escorts are OVER. (in 2 more weeks)
>>106110641 (OP)( หถหแหหต )Blessed thread of goog quality gens and frenship.( หถหแหหต )
But please help us here
>>106087463 nobody is using this thread, is starving (๏น)
>>106110889bro that's just the smell of sex. it's consistent because its sex wtf keklmao why is your brain telling you it must be the butthole did a butthole kill your parents or something
>>106110889Ask them to wash their butthole before sex what the fuck
>>106110835like many experiences, it completely cured me of my curiosity.
>>106110849a lot of girls don't wash their asses before sex or after shitting. if they shave, shower, and wash it (like pregame with soap and lube), there's no smell, but depending on if they used lidocane gel, your tongue might go numb.
>>106110889if you assume this anon is a homo it makes more sense
>>106110889there's one constant in all of your experiences, can you guess what it is?
>>106110900No, it's not. No it only happens when my nostrils and the asshole are in direct line of site. Any other position is fine.
>>106110900>>106110889>butt smell is the smell of sexNot everyone is a fag. And women's asses smell like heaven.
>>106110914>your tongue might go numbyou reminded me of the wolf of wallstreet scene
we need a lora for that angle, ass eating but woman on hands and need facing the camera
>>106110922ok tell them to wash their ass wtf
>>106110929imagine having reading comprehension this poor
Is this a general of sex havers? Seriously, guys, how do you have sex? It's impossible to understand women, or maybe I understand them very well and they're just terrible people?
When does the next Chroma version release?
>>106110952I do which is why my gens are the best this thread has to offer
>>106110952dont understand women, just find one whose face you enjoy looking at and turns you on when she's acting annoying
>>106110952Nah, everyone is LARPing or a tourist. I'm almost a wizard and sex is impossible. Women have literally walked away on me at parties.
>>106110961Its over for Chroma, over for Comfy, over for e-thots
ngl, I kind of assumed people who have never actually had sex were a running joke. Even if you don't have it often, I kind of assume everyone has had at least one sexual encounter.
>>106110978how much can you bench
>>106110899We can all see you're desperate to get /adt/ killed. It's not working sperg.
>>106110952Don't wash or shower, dig in your nuts fall right in front of a desirable woman and scream Momieeeee like a baby pig
>>106110984>>106110982>>106110978>>106110952This is the AI thread, not being a retarded spammer thread.
Does anyone here like to use AI to make images and videos?
>>106110982you're forgetting that when you think of "the bottom 10% of the population", you're not actually imagining the bottom 10% of the population
>>106111001kek fatty detected
>>106110984Not much. I use dumbells and I'm up to 40. Keep in mind the last party I attended was years ago anon. Like 5 years. What I look like now does not matter. Though women may talk to me more, I still think they seek nothing but attention.
So the low noise model just cleans up the high noise output right? So it doesn't really make sense to apply any LoRAs to the low noise output that aren't the usual speed up ones?
>>106110974>>106110977>>106110978 I feel that it is literally impossible to engage in conversation with them, or if I do, they leave a bitter taste in my heart. It's like talking to someone whose brain is completely warped. After dates, I feel more empty than when I'm solo after dealing with them. I sense major narcissism, total worship of the status quo, and a shallow, materialistic vibe.
the genjam should feature the most shiniest nigbo
yeah, im thinking wan 2.2 is an upgrade.
anime girl turns around, showing her ass as she bends over. she is wearing a white leotard under her black dress.
unironically kill yourself this is my sage space for AI gooning not some faggy shit about having sex with women or lifting weights
>>106111046where is the AI, where is the goon?
None of us would be having this problem if rocket girl didn't have poop stuck to her ass.
>>106111045Very nice. Thanks for posting actual gens.
>>106111045let's be realistic wan 2.2 is mid at best and this is a cherry picked gen that probably took hundreds of tries
>>106111072first try with that 2b pic, using the all in one model. all I changed from the workflow template is changing it to euler a.
https://huggingface.co/Phr00t/WAN2.2-14B-Rapid-AllInOne
>>106111028consider evolutionary biology. people who can't sustain attention are more likely to be neglected in a resource constrained environment, especially if they're caring for a child. someone who can tolerate that when being actively driven off is a good candidate for a partner.
if you can look past that and see the person inside, poop mane and all, you're on the right track.
here is the highest t2v 2.2 + lightx2v resolution i can do with my 16gb vram and 32gb of ram
o467lr.mp4
at least 540p is possible for me
>>106111028you sound lovely to be around
take it seriously and bench your body weight
>>106111039>I sense major narcissism, total worship of the status quo, and a shallow, materialistic vibe.yeah its kinda hot. you have to fetish!ze your life dog. also most of those problems go away when you never ever talk t0 women who have tattoos just saying
>>106111046whatever you say n0gen
>>106111098this was me btw but the spam filter was giving me shit for the c-tbox link for some reason now i look dumb and you guys wont listen to my honest and good faith advice :/ i love my wife
I'm not convinced the all in one is a good merge.
>>106111091>consider evolutionary biology.this is exactly what i mean by "yeah its kinda hot". change your viewpoint. these are actually signs of good genes. if you can't believe it fetishize it i dunno
>>106111106i just wish it didnt have pusa like why be so opinionated like that
>>106111106Needs a gguf first
>>106111098Very high clarity.
>>106111104>local diffusion general>let's make it a diary general
why do other boards discriminate against AI images so much?
>>106110840it's a good video game
>>106111106it's a good merge
>0 submissions for genjam
Yeah rip
SILENCE, MORTALS. ANUBIS DECREES THAT THIS IS A FUCKING AI BOARD
>>106111135the anon running it said it was canceled already
>>106111133Yeah the thing is I'm seeing a lot of artifacts in your gen.
>>106111130It's very easy to understand. As much as I like AI, it appears very low effort and is tainted by a certain group of people I don't think I'm allowed to call out on /g/ without getting a warning spamming their shit all day at every opportunity.
>>106111152AI is allowed outside of /g/. The jannies actually purge posts that screech about "spamming" or calling an AI poster a "jeet".
>>106111169How do you know that's the anon running it?
>>106111176how do you know it's not?
>>106111181So basically you're guessing. Good to know.
>>106111165You're misunderstanding me. I never said it wasn't allowed, just why people do not like it.
>>106111187i thought you knew
>>106111106it just werks, plug and play. good enough for early days. that two stage sampler workflow is retarded.
>>106111116newly divorced in my 30s, dating has been amazing. no apps, no websites. i haven't had to pay for a drink in over a year.
>>106111135they just got excited and announced too after the last one. it's a friday night, and classes start back soon. there's a rhythm to successful posting.
>>106111152>a certain group of peoplen...nerds?
>>106111209If you're going to make Yume Miru Kusuri gens, I am surprised you haven't made any for a bully route
A man with dark hair turns into a super saiyan with spiky yellow hair, and a yellow power aura around him.
not quite dbz but kinda wild
>>106111028>Though women may talk to me more, I still think they seek nothing but attention.This, this is the blackest of pills, there is no greater meaning in woman, there is no thinking of community or the good of humanity and the earth, only atention.
>>106111238this general really is full of virgins
>>106111225Ok Todd, please top, I'll buy Skyrim again!
>>106111104>you have to fetish!ze your life dogI'm sorry, but no, I'm going to fetishize empathy, good intentions, honesty, trust, and helping others. Provocative clothing, makeup, painted nails, and waxed bodies are a socially aproved mental illness.
If a waoman want to have my atention they will have to be like virgin mary.
been out of the loop for a little while. So wan 2.2 is the current best vid gen?
Can anything make people throw up yet? I have not been able to get that to look good with any video model.
>>106110914>but depending on if they used lidocane gel, your tongue might go numb.>i hate butts>also i lick themthis whole thing is pretty much no one fucking asked. feel free to stop now my man.
>>106111277I saw a gen of someone puking green goo on george bush during 9/11 so yes.
>>106110820shave your armpits
shave your butthole bush
no one wants to see that greasy lookin thang!
not even YO MAMA!!
>>106110832This is what I want
To take the all the rule34 stuff out there and creating porn clips of it
>>106111279what does a virgin who can't read timestamps turn into when they're 30?
a necromancer, apparently
Thoughts on the new 2.2 nsfw model? Anyone tried it yet? Gens?
>>106111298All NSFW models that come out within days of the base being released are just shitty LoRA merges and should be ignored.
>>106111303wasn't asking you wanschizo
Anyone using wan 2.2 Q8 .gguf files? What CLIP and VAE are you using? I keep getting errors.
>Given groups=1, weight of size [5120, 36, 1, 2, 2], expected input[1, 32, 31, 84, 64] to have 36 channels, but got 32 channels instead
Reddit says delete "flow2-wan-video" node, but it isn't installed, and isn't in my custom nodes folder.
>>106110952This is unironically a better and more fun use of my time than continuing to chase after used up fleshbags who I never really had much of a chance with.
>>106111098Can 12gb generate lower resolution clips?
>>106111225https://huggingface.co/Bisre/dbz-fight-style-lora
A man with dark hair turns into a super saiyan from dragonball z with spiky yellow hair, and a yellow power aura is around him with lightning crashing around him.
kek
should've specified I wanted scifi...
Has anyone touched LoRA training for Wan yet? I'm wondering if you're supposed to train the high noise or the low noise.
a red hair anime girl is painting a picture of Miku Hatsune.
the gen was done with illustrious but the painting motion is pretty good.
>>106111435Did you try asking google gemini to give you img2vid prompts? Is very good at it
is there an easy way to get into video2video?
>>106110641 (OP)>2.1 Guide: https://rentry.org/wan21kjguideFor fuck's sake why is this still in the OP? It's outdated as fuck. It's not even recommending the latest lightx2v.
kino
md5: ab72d3fc0faecbc986d448c1089a9af5
๐
cinema general
>>106111513bee the change you want to see in the world.
>>106111483go back to /adt/
>>106111298tried it, the creator is right it's still fucking raw
>>106111525shut up,
>>106111515 the cuck porn is about to start
>blocks.0.cross_attn.k_img.weight
What was causing this? I forgot.
>>106111519basedo
>something about crows feet>that ol' dustbowl look>those tired eyescute, CUTE!!!
>>106111594i wonder if he would laugh at these or be visibly upset.....
>>106111592Cute! Can you make her get hit by a truck?
>>106111620technically yes ;3
>>106111277>Can anything make people throw up yet?yeah, the gens in /sdg/
result of that last lora bake for Chroma, my usual lora approach from past models isnt holding up
>>106111684>It's another chroma didn't live up the promises postUgh, I don't think I can take another plate of "I told you so"s
>>106111684and same seed & prompt from the flux lora i baked (Same dataset)
Can you guys just give pony another chance? The prejudice against it here is ridiculous.
>>106111673crt delivery truck <3
>>106111726The problem with pony is that by the time the creator is done checking the license of the base model they chose, it's already outdated
>the genjam wheel actually rolled anime, lain and schizo
>>106110641 (OP)Just letting you know I appreciate these collated webms in the OP. Always full of cool new shit to look at.
>>106111726pony eyes are instantly recognizable, you can never unsee them, and it ruins 99% of gens from that point onward.
>>106111435>>106111460ok what did you prompt for the hairspirals
i have a character i was trying to generate
but the danbooru tags aren't working w\ my checkpoints\lora
>>106111811>what did you prompt for the hairspirals>retarded namefag doesn't know his vocaloids
Update the fucking wan rentry
f2-5
md5: 425e905e854426c5ccd5f6fa51c4a1a0
๐
AI artist here again, I'm going to share more of my new discoveries in my experience with Anime Genning.
>novaAnimeXL_ilV90
Works with Forge Couple for multi-girl. Don't overprompt or it shits the bed.
>Inpainting
Use actual inpaint models or CN inpaint, not your base model + mask like a retard. Match styles or enjoy your obvious seams.
>LoRA/Prompting
ComfyUI doesn't parse lora:name:weight in prompts
Load via nodes, newfag. Only use trigger words in prompt.
>Tags
Text encoders understand synonyms, booru autism not required. But "young" tag gives you calarts bean mouth cancer, avoid.
>Adetailer
Face > eyes only. Eyes-only = color drift. Bad LoRAs = persistent eye AIDS. Train your own or cope.
>Regional Prompting (Forge Couple)
Image breaks with too many attributes
You're overloading it, retard. Keep regions simple:
Less attributes per region
Global stuff in main prompt
Lower CFG (5-6)
Use CN for layout first
Hires Fix crashes
Known issue. Either:
Upscale separately (based)
Use ComfyUI two-stage (better)
Cry about it (you)
Upscaling
Extras tab = placebo. Use:
Latent upscale + refiner
4x-UltraSharp/ESRGAN anime
Low denoise inpaint pass after
Captioning (for training)
>Local options that don't suck:
qwenvl2.5-3b (fast, decent)
ernie-4.5-28b (VRAM pig)
torii models (NSFW chad)
ERNIE 424 (best but will melt your 4090)
tl;dr
TLDR?
Eye problems = shit LoRA
Regional prompting breaks = you're doing too much
Inpaint properly or enjoy your seams
ComfyUI > * if you're not retarded
Stop using 20 style tags per region
>IMPORTANT:
I will be here for the next hour
Post your setup and what you're trying to gen if you want actual help instead of generic advice.
>>106111832>ComfyUI doesn't parse lora:name:weight in promptsThere's a custom node that does this. lora loader + autocomplete will make it easier.
Does radial attention actually works?
Why was RadialAttn such a huge disappointment? Why can't they get it right?
I lost my job because of radial attention.
>>106111845yes
>>106111855Too finicky. Only supports very specific resolutions
>>106111726I redid some old pony prompts with my current workflow and they turned out surprisingly good. however, pony has some serious fucking problems:
>the pony look>way worse pose control and comprehension in general than noob>obfuscated artists>limited stylesunless pony has something very specific you want, there's 0 reason to use it
488
md5: 99b62d617630336ec55d95fa6ec51a95
๐
How do I make this shit work?
>>106111726I never bothered learning how to prompt pony. The only reason I'd use it is because it has some really really good realism models.
>>106111896disable speedups and see what happens
>>106111822v helpful anon thnx!! ;3
>>106110641 (OP)Anyone have a link to the full size pink anime girl with big booba webm?
>>106111911It was the 2.2 model. It refused to work it. But 2.1 was fine. What an ass workflow
>>106111832Do you have a rough estimate how long q Video model would need to train on a Framework Desktop 128GB AI Max 395+?
Ist in the realm of days, weeks, months? I have approx 14TB of training videos, but it's not tagged.
>>106111899even for photorealism, I think illustrious models are competitive with the pony ones now. hard to tell though, since all of the examples on civit are slopped as hell
>>106111888Not Wan, Chroma
>wan 2.2
>10 steps takes 4 minutes
>15 steps takes 3 minutes
how did they do it??
>>106111896model loader is for loading safetensors, you need to use the gguf loader for gguf files
>>106111671dang it. I opened /sdg/ to look before I got it.
>>106111866More like you lost your wife and children because of fag attention
>>106110641 (OP)ALL THESE MODELS SUCK HOW DO YOU GEN SOMETHING NICE? I HAVE COMFYUI I TRIED FLUX, TRIED SDXL TRIED REALISM THEY ALL GEN GARBLY GOOP GARBAGE AND ALL REFUSE TO DO NUDE WTF???
>>106112101I TOLD SDXL TO GEN "A BANANA" THIS IS WHAT IT MADE. IT CAN'T DO ANYTHING RIGHT
>>106112101>>106112109SD 1.4 is what you need
>>106112114THERE IS NO 1.4 IN THE TEMPLATES AND I THINK THAT WOULD BE A DOWNGRADE I NEED BETTER NOT WORSE
ALSO WHY DOES FLUX HAVE TWO POSITIVE PROMPTS? WHAT THE FUCK IS THAT ABOUT?
>>106112132LOOK AT THIS SHIT I CLEARLY TYPED NIGGER AND IT STILL GEN A WHITE FAIRY
>>106110641 (OP)Can this new wan2.2 rentry be in the OP: https://rentry.org/wan22ldgguide
It needs a lot of work still, but it's basically a modified version of the 2.1 guide, except with updated links and workflow.
Using kijai's example workflow, except without the retardation (like having save_output turned off)
>>106112173Good work anon!
>>106112173Good job, a few points tough:
1- The title is still "Wan 2.1"
2 - No mention of WAN2.2-14B-Rapid-AllInOne that some anons are using:
https://huggingface.co/Phr00t/WAN2.2-14B-Rapid-AllInOne
>>106112201>1- The title is still "Wan 2.1"fixed.
>2 - No mention of WAN2.2-14B-Rapid-AllInOne that some anons are using: https://huggingface.co/Phr00t/WAN2.2-14B-Rapid-AllInOneI tried it and got bad results but maybe it's good for others. Feel free to write a segment on it and I'll include it.
Again, very rough and early stage for this new rentry. I just think it's important to update all the old information so no newcomers end up using old workflows.
>>106112225only thing i'd say is to make a json for a t2v workflow as well in the lightx2v section
because idk it just feels incomplete to me to not have one for both t2v and i2v
>>106112280What model is this
>>106112300Which version? If it's regular dev Chroma can do all that just fine.
>>106112280Good, that means it doesn't waste latent space on shit
can anyone share an optimized Krea workflow? been a while, i'm not used flux
>>106112304regular dev
> Chroma can do all that just fineno, pic related is the best i got, it is never able to get good consistency even when explicitly prompt 2x2 grid with the logo for each square
>WanWrapper updated
>I2V started hallucinating shit on input images with white background
aaaaah
is krea a game changer or a meme
>>106112331Oh I see, you want specifically a 2x2 grid. Dev is already really good at this type of stuff, Chroma wasn't really made for purpose of graphic design alone so I guess that's one area where it possibly regressed.
>>106112283Kiji's 5B workflow isn't loading for me in comfy and he doesn't have a 2_2 14b t2v workflow on his github.
Know a good one? Otherwise i'll just use bullerwins I guess
>>106112350meme, it removes a lot of the plastic skin and Flux chin, but it also loses the 'almost always good anatomy' which was the one saving grace of Flux dev
it's rather pointless, and since doing any nsfw finetuning on it is a breach of its license, it's totally DOA
Wan and Chroma are the image models of any interest at this point
Chroma data preprocessing is fundamentally fucked.
If you try to generate a circular object/logo it's frequently cut off from right and bottom.
It's obvious that images are cropped from the top left when doing training, instead of doing proper downscale.
>>106112408Similarly neta lumina data preprocessing is also fundamentally fucked.
Notice the horrible aliasing on the thin outlines here that happens frequently
It's obvious that images are downscaled with nearest neighbor or something similar (which produces aliasing) when doing training, instead of doing proper downscale.
>>106112400I dunno why people are complaining about Krea anatomy, it's been quite good for that for me. Are they using low-end GGUFs or something?
>>106112015Illustrious is MUCH harder to train realism into than Noob, at least lora-wise, I've found. Both are way harder than Pony for that though, it clearly had way more realistic data in it to begin with.
>>106112408How would you propose automated scaling and resampling of images that almost fit inside a bucket, but not quite? It's either a slight crop or adding fuckugly letterboxes, and letterboxes in datasets are the worst. I can live with cropped circles.
0_0
md5: 0b06a332154f0f9998b1d9fbe76d5589
๐
>>106111845I couldnt get it to work, anyone else possibly? We'll soon have 3 versions in total:
Kijais version (never tried this, I only use native)
>https://github.com/kijai/ComfyUI-WanVideoWrapperwoct0rdho version
>https://github.com/woct0rdho/ComfyUI-RadialAttnThe real version (yet to release)
>https://github.com/mit-han-lab/radial-attention
>>106112173>>106112395Also I just created a hackmd version that anyone can edit:
https://hackmd.io/RDxlWe8mQCSUi72yUDEzeg?both
Welcoming all edits and contributions to the new wan rentry guide.
>>106112400>and since doing any nsfw finetuning on it is a breach of its license, it's totally DOAThat's not in the license at all, what are you talking about
Also how are you complaining about anatomy while praising Chroma lol, it has worse anatomy than the original Flux Schnell did most of the time
>>106112465Crop at center, and if something looks cut off after that, describe as cropped in the text prompt.
>>106112280Chroma gen (also catbox appears to be down so can't share)
But here was prompt
>This image displays a collection of four globally recognized brand logos, arranged in a two-by-two grid on pastel-colored backgrounds.>Top Left: The iconic, cursive red script of the Coca-Cola logo is sits on a plain, light background.>Top Right: The unmistakable yellow "Golden Arches" of McDonald's form a bold 'M' against a solid red square.>Bottom Left: The wordmark for the gas company Mobil is shown in blue, with its signature red 'o' providing a pop of color.>Bottom Right: The logo for the convenience store 7-Eleven is displayed, featuring its well-known design with a green border, an orange and red numeral '7', and the word 'ELEVEN' across the middle.
>>106112465The only way around this is to fully use synthetic data, like Flux dev, but then you get the plastic skin uncanny samefaces.
You could create a synthetic dataset specifically for logo stuff etc, but really, who the fuck cares about that in a base model 99% of its use will be hot realistic females, anime, fantasy / sci-fi art ?
If you need good quality logos it is easy to train a lora for that use and the results will be much better.
Make the model as good as possible for 99% of what it will be used for = win.
>>106112465whatever it is that Kohya does seems to work completely perfectly 100% of the time for Loras
>ram at 60%
>vram at 80%
>dies when entering VAE from the second sampler
The kijai workflow is PURE fucking ass
>>106112481>Also how are you complaining about anatomy while praising Chroma lol, it has worse anatomy than the original Flux Schnell did most of the timeNo it doesn't, stop making shit up.
>>106112280>>106112331>>106112495Wait a minute this isn't art, you just generated a bunch of corporate logos!
Apparently faster than NAG
>This node implements Value Sign Flip (VSF) for negative guidance without CFG in ComfyUI. It is designed for object removal in video generation (e.g., removing bike wheels), not for quality improvement. Using prompts like "low quality" as negative could increase quality, but could also decrease it.
https://github.com/weathon/VSF/tree/main/comfyui/custom_nodes/value_sign_flip
https://www.reddit.com/r/StableDiffusion/comments/1mfh3e8/vsf_negative_guidance_for_wan_t2i/
>>106112600It's the Chroma hater guy, he is desperately trying to find some way to attack it, now he is at the stage where he complains over it not generating logos good enough.
I used to think he was just mentally ill, but given how he always defends BFL no matter what, he must be some kind of shill.
>>106112484Illustrations shouldn't be cropped at center, this will cut off the head aka sd1.2. Cropping should center around YOLO-detected focal point, and that will introduce asymmetry.
I cannot believe how fast AI porn is evolving. This shit will replace real porn in a couple years.
00307
md5: 2aaeb4709a8fb02cbb752f0004105cf4
๐
>>106112521of course dr steve would post here lmao.
>>106112625only if it's big and black
>>106112539yeah it does man, the hands in it are generally awful without serious schizo negatives and luck
>>106111296>timestampsnot even you know what your point is. shut the fuck up and post titties. it's all you're good for, and even then only barely.
How's the quality of the AIO Wan comapred to urnning double q8?
>>106112600>this isn't artHow many corporate logos can you draw from memory? Art or not, this is impressive.
>>106112622who the fuck is dr steve? my preggo cortanas are mine.
>>106112725>How many corporate logos can you draw from memorywhy would i want to do that?
>>106112670The model was trained to be used with negatives, just like SDXL, SD15, SD14
If they didn't want negatives, they would have trained a fake cfg distilled version, like Flux dev and Flux schnell
I'm glad they give you negatives, because gives tremedous increase in control, retards like you should stay in the BFL sandbox
>>106112739The point is it needs too many and it's just very obvious that most of the dataset is NOT photographic at all, like I said Chroma is good for other stuff, it's not good at realistic gens without a lot of hassle
I got wan 2.2 running. AMD GPU 16GB VRAM, Linux, torch compile for the models and VAE. Performance differences:
2.2
>Have to run Q3>220/it>can run Q8>140/it2. looks worse
2.2 is also significantly worse quality due to Q3. I am running lightx2v lora.
>>106112141kek
>>106112768>AMD GPU 16GB VRAM, >Have to run Q3Is this an AMD moment because I run Q8 with a 4070S
>>106112400Distillation kills its purpose for me. If it were undistilled, maybe then it'd hold a candle to Chroma. Fake skin can't be defeated with distillation, distillation itself causes it.
>>106112670You clearly haven't tested the model thoroughly to arrive at such a conclusion.
>>106112604>VSF is better when:>You want simplicity>You're doing standard negative promptingopus seems to think this shit is better than nag. gotta try it out, good looking out hommie.
Am I supposed to split the steps 50/50 between the low noise and high noise models?
I've been out of loop. what is flux krea and how is it different to flux dev. is it just a different checkpoint or is it mechanically different and need different workflow?
>>106112850focused on realism, they tried to train out 'ai look'. they failed, but it is still neat if you like 3dpg gens.
>>106112850it's a photographic finetune of the raw Dev weights, runs the same way
>>106112759>The point is it needs too manyAccording to what authority ? Doesn't need more than any other model with negatives.
I typically use ~8 to 12 negatives depending on content and what I want to remove
Here's an example:
prompt: female large breasts movie film still southern belle dress american south plantation
negatives: low quality, ugly, unfinished, out of focus, deformed, disfigured, blurry, cropped, necklace
Using a 512 resolution (which is low) Chroma test rank 16 lora of Bryce Dallas
I've trained on all models from sd15 forward, Chroma is the easiest to train and gets the best results on people, and equally if not better on artstyles, although admittedly I have only trained a few styles as of yet.
Can someone update the wan entry?
I'm sure you didn't do it yourself, right? You stole it from somewhere else.
>>106112897At this point I'm sure he's just baiting. You can tell especially because he claims Chroma wasn't trained on photorealism (the one thing it excels at) compared to other models. Utter nonsense.
>>106112943questionable crease
>>106112943The anatomy on this is all manners of fucked up
tried updating to wan 2.2 and it's been kicking my ass
on the one hand it makes loras give way better results even when they don't really adhere to prompts, but on the other hand, they been looking weird like this
any reason why? it kinda looks like it's not done denoising or has enough steps
diffuse model is wan2.2_t2v_low_noise_14B_fp8_scaled
vae is wan2.1
got a 4070 12GB
workflow is the one from the rentry
>>106112956>questionable creaseJust in case, shemale is in negative since I'm asking for the girl to be athletic
>>106112943Yes you are probably right, outside of Wan, no other model I've used comes close to Chroma when it comes to photorealism, and it also handles tons of different photo / film styles.
Here's my attempt at using Chroma to mimick 70s sexploitation style taking place in the south during the plantation era.
Again these are all using low 512 resolution rank 16 loras, once Chroma final is released I will retrain these in 1024 rank 32-64
>>106110641 (OP)Your local diffusion model completely fucked up the Giger art BTW. It was supposed to be bullets in a magazine, not a razor keyboard.
Remember to help out with the new wan rentry if you can:
>>106112477
>>106112768You're doing something wrong, 2.2 and 2.1 have the same speed and memory consumption.
>>106112943this details on the background here are nice but the girl looks like something out of a good Pony or BigASP based model, super messy details. And six fingers on one hand. This is exactly the sort of thing I mean, it's a good model but there's just obviously more non-photographic data than not (which is exactly what you'd expect from Lodestones, the guy who also made Fluffyrock).
I remember reading that Neta lost funding so the V1 model is the final release. But then they claim that one can sign up for beta releases, which wouldn't make sense if they lacked funds/GPU hours.
Is Neta dead?
>>106111296My beautiful nubile daughter
>>106112477I wish I could but esl
>>106112500He does it the same way Chroma does.
>>106113028What is Neta ?
>finally upgraded
This GenJam is mine.
>>106113073Show me the specs or I refuse to believe you!
/adt/ is not doing collage. do you still need to do that cringy stuff?
>>106113086maybe once i figure out why the wheels fail to build
>>106112960Chroma does anatomy fine anon, better than any other model out there.
>>106112999Yes, Chroma is quite impressive, no other model compares not only to its photorealism but also scene coherence with complex prompts, and it's the only model I can trust has seen enough naked women and regular looking images that it can pull them off in any position or situation, both SFW or NSFW, while still looking realistic. And if your prompt is good enough, it probably gens better images because Wan needs a proper Chroma style tune.
>>106113140do they have anything to put in a collage
>>106113149While Chroma is slower than distilled models like Flux dev and Flux Schnell, it is faster than Wan, and Wan needs higher resolutions than Chroma to look good, and training loras for Wan is much slower and hardware demanding than Chroma.
So overall Chroma wins for me, but I honestly don't care much which one comes out on top in community support, both are great.
>>106113146Have you upgraded to a 50xx Nvidia card ? If so you need Pytorch 2.7 or later and also bundled with Cuda 12.8 or later
>>106112960Are you sure you want to argue with him? He is abnormally dedicated and will outdo you through sheer online presence, even if his taste in images is questionable.
I almost didn't realize this is how you're supposed to load Flux loras when using Nunchaku. I hope no one else is accidentally overlooking this.
The other day I was going to start using Illustrious for SFW generations but with improved Flux speed I think the tiers might be:
Nunchaku Flux Loras > Illustrious > Flux Loras > Flux
I just can't deal with the slow speed of Flux. Wan image generation is even worse.
I have a 5090, but just curious, how fast can the apple ARM chips run the larger wan models in it/s ?
Are WAN loras trained on 3dpd going to bleed into anime gens?
>>106113156does this thread?
>>106113211i have both. downgrading python like https://github.com/comfyanonymous/ComfyUI/issues/7744#issuecomment-3139344136 suggests gets it to build but when running main.py it returns no cuda gpu available. its strange, fastfetch shows the gpu but the display is listed as "unknown", also it takes a couple of moments after boot for the screen to stop lagging for lack of a better term. even then, i feel like something amiss with the driver even though i have the nvidia package installed. im also on lts kernel
i dunno
>>106113252Yes, that is a standard side effect. You can adjust the weights to your liking.
Personally I prefer not to use loras for anime but sometimes you have to for genitals.
>>106113192There is an advantage to Wan though. Since its base is so strong I imagine it would be much faster to converge than Chroma, like 5 epochs fast
>>106113284yes
just one R guy worth it
>>106113252It's actually a lot easier with a dual sampler setup. You can use a lora at higher strength for the 1st ksampler when general shapes and movements are forming, and then use it at lower strength for the 2nd ksampler when the style forms
With Wan 2.2 i2v and lightx2v what step values to use in ksamplers?
You can just denoise full 4/4 with low noise checkpoint and the output will not look worse than 2.1 at least. I'd say for dynamics it will even look better than using high noise model first.
>>106113357Might be, I was waiting on Wan 2.2 to try training a lora (for text to image, not video), and Diffusion-Pipe has Wan 2.2 support now so maybe it's time to try.
The guy who did a flurry of quality text to image Wan 2.1 loras stated that he never trains on more (or less) than 18 images, not sure why he set such a arbitrary limit, but you can't argue with the results:
https://civitai.com/models/1773251/wan21-classic-90s-film-aesthetic-the-crow-style
https://civitai.com/models/1767169/wan21-nausicaa-ghibli-style
I noted when I tried these that you really need resolutions in the ~1600-1900 range for the gens to look really good, so you better have patience
>>106113359cringe as I said
>>106113327nvidia-open-beta doesn't work either. the wheels for sentencepiece refuse to build aaa
>>106112350>kreaLeft is krea right is flux dev. Both are fp8 scaled.
If you were using flux without loras you should change to krea. It is much more aesthetic. Other than this use case I think it took them too long for it to be relevant. I only use photorealism sparingly I am keeping the krea model for that, and I might as well delete flux.dev as I will only use Chroma for painting like gens.